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METHODS OF CONSTRUCTING AND SCREENING A LIBRARY OF 
POLYNUCLEOTIDE SEQUENCES OF INTEREST IN FILAMENTOUS FUNGAL 

CELLS 

Background of the Invention 

Held of the Invention 

The present invention relates to methods of constructing and screening a library of 
polynucleotide sequences of interest in Filamentous fungal cells. 

Description of the Related Art 

Filamentous fungi have been widely used as host cells for the commercial production of 
polypeptides. However, when it is desirable to produce a variant of the polypeptide with 
specified altered characteristics, e.g., thermostability, pH activity profile, specific activity, 
substrate specificity, K^, etc., the construction and screening of a library of variant en- 
coding sequences commonly requires the use of an intermediate host, e.g., bacterial cells or 
yeast, due to the low frequency of transformation and a variation in copy number among 
independently transformed filamentous fungal cells. 

Several methods for the construction of libraries of polynucleotide sequences of inter- 
est in yeast have been disclosed in which the libraries are screened in yeast prior to trans- 
formation of a production relevant host, such as, for example, filamentous fungi with the 
potential variant polynucleotide sequences of interest. 

Often however, a polynucleotide sequence identified by screening in yeast or bacteria 
cannot be expressed or is expressed at low levels when transformed into production relevant 
filamentous fungal cells. This may be due any number of reasons, including differences in 
codon usage, regulation of mRNA levels, translocation apparatus, post-translational modifi- 
cation machinery (e.g., cysteine bridges, glycosylation and acylation patterns), etc. 

Secondly, whether a polynucleotide sequence of interest would be expressed in the pro- 
duction host at commercially useful levels is not necessarily predictable. For example, if 
the organism used in screening the library is, e.g., a bacterial or yeast cell and the produc- 
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tion relevant host cell is a filamentous fungal cell, the protease profiles differ. Thus, a se- 
quence encoding one or more characteristics of interest which has been identified in yeast 
may be degraded by proteases expressed in the product relevant filamentous fungal host 
cell. Furthermore, to obtain optimized yields of the expressed product by altering the func- 
5 tion of regulatory proteins or regulatory sequences requires direct manipulation of the pro- 
duction host. 

A. Aleksenko and AJ. Clutterbuck (1997. Fungal Genetics and Biology 21:373-387) 
disclose the use of autonomous replicative vectors, or autonomously replicating sequences 
(ARS), for gene cloning and expression studies. AMA1 (autonomous maintenance in As- 
10 pergillus) is one of the plasmid replicator elements discussed. It consists of two inverted 
copies of a genomic repeat designated MATE1 (mobile Aspergillus transformation enhan- 
cer) separated by a 0.3 kb central spacer. AMA1 promotes plasmid replication without re- 
arrangement, multimerization or chromosomal integration. 

15 

Summary of the Invention 

It has been found that AMA1 -based plasmids provide two advantages in gene cloning in 
filamentous fungi. The first is a high frequency of transformation which both increases the 

20 potential library size and can eliminate the need for library amplification in an intermediate 
host, e.g., E. coli, so that a recipient Aspergillus strain can be transformed directly with a 
ligation mixture. Secondly, by providing a stable and standard environment for gene ex- 
pression, the properties of the transformants will be uniform. 

It is an objective of the present invention to provide improved methods for constructing 

25 and screening libraries of polynucleotide sequences of interest in filamentous fungal cells by 
use of an episomal replicating DNA vector to provide a high frequency of transformation 
and a uniformly high level of gene expression among independently transformed cells. By 
minimizing variation in copy number among independently transformed cells, a variant 
polypeptide of interest may be identified directly on the basis of expression of the charac- 

30 teristic(s) of interest. 
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Accordingly, in a first aspect of the present invention relates to a method of 
constructing and selecting or screening a library of polynucleotide sequences of interest in 
filamentous fungal cells wherein the method comprises: 

(a) transforming the fungal cells with a population of DNA vectors, wherein each 
vector comprises: 

(i) a polynucleotide sequence encoding a fungal selective marker and a fungal 
replication initiating sequence, wherein the marker and the replication initiat- 
ing sequence do not vary within the population; and 

(ii) a polynucleotide sequence of interest wherein the population of DNA vectors 
contains more than one variant of the polynucleotide sequence; 

(b) cultivating the cells under selective pressure; 

(c) selecting or screening for one or more transformants expressing a desired charac- 
teristic; and 

(d) isolating the transformant(s) of interest. 

In other aspects, the invention relates to the use of a fungal replication initiating 
sequence to construct a library of polynucleotide sequences of interest and to screen or 
select a library of such polynucleotide sequences. 

Brief Description of the Figures 

Figure 1 shows a restriction map of the plasmid pENI1298, the construction of which is 
described in Example 1; 

Figure 2 a restriction map of the plasmid pENI1299, the construction of which is 
described in Example 1; 

Figure 3 a restriction map of plasmids pDM156.2, pJRoy47 and pDM222. A described 
in Example 5; and 

Figure 4 a restriction map of plasmid pJRoy44 described in Example 5. 

List of SEQ ID Nos: 

SEQIDNO:l The fungal replication initiating sequence from A. nidulans (AMA1 
sequence) 
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SEQ ID NO:2 truncated AMA1 

Detailed Description of the Invention 

5 In a first embodiment, the present invention relates to a method of constructing and se- 

lecting or screening a library of polynucleotide sequences of interest in filamentous fungal 
cells, wherein the method comprises: 

(a) transforming the fungal cells with a population of DNA vectors, wherein each 
vector comprises: (i) a polynucleotide sequence encoding a fungal selective marker 

10 and a fungal replication initiating sequence, wherein the marker and the replication 

initiating sequence do not vary within the population; and (ii) a polynucleotide se- 
quence of interest, wherein the population of DNA vectors contains more than one 
variant of the polynucleotide sequence; 

(b) cultivating the cells under selective pressure; 

15 (c) selecting or screening for one or more transformants expressing a desired charac- 

teristic and 
(d) isolating the transformant(s) of interest. 

The term, **a library of polynucleotide sequences of interest" denotes a collection of 
polynucleotide sequences which encode or have the same function or activity of interest. 

20 The library may be "a library of variants of a polynucleotide sequence of interest" which is 
defined herein as a collection of variants wherein the variants differ from a parent polynu- 
cleotide sequence(s) by comprising one or more modifications of said parent polynucleotide 
sequence. Conveniently, the polynucleotide sequences or variants are generated from at 
least one parent polynucleotide sequence of interest by mutagenesis, preferably random 

25 mutagenesis, resulting in a minimum number of 4 and preferably a minimum of 25 differ- 
ent polynucleotide sequences in the collection. Various methods useful for mutagenizing a 
parent polynucleotide sequence are described in the sections further below entitled "Random 
mutagenesis" and "Localized random mutagenesis". The term "modification(s)" is intended 
to indicate a substitution, insertion and/or deletion of one or more nucleotides in the se- 

30 quence as compared to that of the variant and may include a hybrid of two or more different 
parental polynucleotides. 
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The polynucleotide sequences or variants may also result from naturally occurring alle- 
lic variation of a parent polynucleotide sequence. An allelic variant denotes any of two or 
more alternative forms of a gene occupying the same chromosomal locus. Allelic variation 
arises naturally through mutation, and may result in phenotypic polymorphism within 
5 populations. Gene mutations can be silent (i.e., no change in the encoded polypeptide) or 
may encode polypeptides having altered amino acid sequences. Furthermore, rather than 
being generated from one or more parent nucleotide sequences the polynucleotide sequences 
of the library may be derived from different sources and encode different, but highly related 
polypeptides having the same activity or function. In the present context, "highly related" is 

10 intended to indicate that the polypeptides have the same activity or function and in addition 
are encoded by a polynucleotide sequences having shared conserved regions as defined in 
the section further below entitled "DNA shuffling". 

The term "derived from" is intended to indicate that the polynucleotide sequence is 
isolated from a specific source, such as a microorganism. The polynucleotide sequence may 

15 be one generated from a non-cultivable organism. The polynucleotide sequence(s) may be 
any polynucleotide sequence having or encoding a biological activity or function of interest 
and comprising a subsequence of at least six nucleotides to allow for production of a variant 
nucleotide sequence. 

The term "same activity or function" as used about the polynucleotide sequences con- 
20 tained in the library is intended to indicate that the polynucleotide sequences encode 
polypeptides having the same or a similar biological activity or function (determined quali- 
tatively) or the polynucleotide sequences themselves have the same (qualitative) activity or 
function. For instance, the polynucleotide sequences may have promoter activity, ie be ca- 
pable of promoting transcription, or may encode polypeptides with the same qualitative ac- 
25 tivity or function such as the same enzymatic activity or function, eg protease, lipase or 
amylase activity. 

The term "DNA vector" is a polynucleotide sequence, which has the ability to repli- 
cate autonomously and which is able to contain a sequence of interest. Thus a DNA vector 
contains a replication initiation sequence, such as an origin (e.g. ColEl origin, Fl origin, 
30 M13 origin, 2\i origin (yeast), oriP (EB virus) (pi). The DNA vector may e.g. be a plas- 
mid. 
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The term "wherein the population of DNA vectors contains more than one variant of 
the polynucleotide sequence" is intended to indicate that the population of DNA vectors 
contain different polynucleotide sequences encoding or having a biological function or ac- 
tivity of interest. 

5 In a preferred embodiment the polynucleotide sequences of interest are polypeptide 

encoding sequences. The term "polypeptide" encompasses peptides, oligopeptides, and pro- 
teins and, therefore, is not limited to a specific length of the encoded product. The polypeptide 
may be native to the host cell or may be a heterologous polypeptide. The term "heterologous 
polypeptide" is defined as a polypeptide which is not native to the host cell. The polypeptide 

10 may also be a recombinant polypeptide which is a polypeptide native to a cell, which is en- 
coded by a nucleic acid sequence which comprises one or more control sequences, foreign to 
the nucleic acid sequence, which are involved in the production of the polypeptide. The nu- 
cleic acid sequence encoding the polypeptide may have been manipulated in some manner as 
described infra. The polypeptide may be a wild-type polypeptide, or a variant thereof, sub- 

15 jected to the method of the present invention. The polypeptide may also be a hybrid polypep- 
tide which contains a combination of partial or complete polypeptide sequences obtained from 
at least two different polypeptides where one or more of the polypeptides may be heterologous 
to the cell. Polypeptides further include naturally occurring allelic and engineered variations of 
the above mentioned polypeptides. 

20 In another preferred embodiment the polynucleotide sequence of interest is a control 

sequence normally associated with a polypeptide encoding sequence. Control sequences in- 
clude all components which are operably linked to the nucleic acid sequence encoding the 
polypeptide sequence or otherwise involved in the production of the polypeptide. Such control 
sequences include, but are not limited to, a promoter, a signal sequence, a propeptide se- 

25 quence, a transcription terminator, a leader, a promoter recognition sequence, an enhancer se- 
quence and a polyadenylation sequence as described herein. In a still further embodiment the 
polynucleotide sequence of interest is a combination of a polypeptide encoding sequence (or a 
part of such sequence) and a) a control sequence (or part of such control sequence) or b) two 
or more control sequences (or parts of such sequences). Each of the control sequences may be 

30 native or foreign to the coding sequence. 
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In a preferred embodiment, the polypeptide encoded by a polynucleotide sequence of in- 
terest is an antibody or portions thereof, an antigen, a clotting factor, an enzyme, a hormone or 
a hormone variant, a receptor or portions thereof, a regulatory protein, a structural protein, a 
reporter, or a transport protein. 
5 In a more preferred embodiment, the enzyme is an oxidoreductase, transferase, hydrolase, 

lyase, isomerase, or ligase. 

In an even more preferred embodiment, the enzyme is an aminopeptidase, amylase, carbo- 
hydrase, carboxypeptidase, catalase, cellulase, chitinase, cutinase, deoxyribonuclease, dex- 
tranase, esterase, alpha-galactosidase, beta-galactosidase, glucoamylase, alpha-glucosidase, 
10 beta-glucosidase, haloperoxidase, invertase, laccase, lipase, mannosidase, mutanase, oxidase, 
pectinolytic enzyme, peroxidase, phytase, polyphenoloxidase, proteolytic enzyme, ribonucle- 
ase, transglutaminase, or xylanase. A specific example of a lipase is a lipase derived from 
Thermomyces lanuginosa or a variant of said lipase, eg a variant in which an N-terminal exten- 
sion has been added to the mature lipase enzyme as disclosed in WO 97/04079. 
15 In another embodiment, the polypeptide is human insulin or an analog thereof, human 

growth hormone, erythropoietin, or insulinotropin. 

In a further preferred embodiment the control sequence is a promoter sequence, preferably 
a fungal promoter, such as a promoter derived from the gene encoding Aspergillus oryzae 
TAKA amylase, NA2-tpi (a hybrid of the promoters from the genes encoding A. niger neu- 
20 tral ot-amylase and A. oryzae triose phosphate isomerase) and Aspergillus niger or Asper- 
gillus awamori glucoamylase. 

In another preferred embodiment the control sequence is a promoter recognition sequence 
such as the amyR recognition sequence (WO 98/01470), the creA (WO 94/13820) and areA 
(WO 95/35385). 

25 Examples of further control sequences of interest in connection with the present inven- 

tion is listed further below in the section entitled "DNA vectors and control sequences". 

Filamentous fungal selective marker 

The term "selective pressure" is defined herein as culturing a filamentous fungal cell, 
30 containing a DNA vector containing a fungal selective marker gene and a polynucleotide 
sequence of interest, in the presence of an effective amount or the absence of an appropriate 
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selective agent. The effective amount of the selective agent is defined herein as an amount 
sufficient for allowing the selection of cells containing the selection marker from cells 
which do not contain the selection marker. 

In a preferred embodiment, the fungal selective marker is selected from the group of 
genes which encode a product capable of providing resistance to biocide or viral toxicity, 
resistance to heavy metal toxicity, or prototrophy to auxotrophs. 

In a more preferred embodiment, the prototrophy is obtained from an enzyme selected 
from the group of metabolic pathways consisting of nucleotide synthesis, cofactor synthesis, 
amino acid synthesis, acetamide metabolism, proline metabolism, sulfate metabolism, and 
nitrate metabolism. 

In an even more preferred embodiment, the fungal selective marker is a gene selected 
from the group consisting of argB (ornithine carbamoyl transferase), amdS (acetamidase), 
bar (phosphinothricin acetyltransferase), hemA (5-aminolevulinate synthase), hemB 
(porphobilinogen synthase), hygB (hygromycin phosphotransferase), niaD (nitrate reduc- 
tase), prn (proline permease), pyrG (orotidine-5' -phosphate decarboxylase), pyroA y riboB> 
sC (sulfate adenyltransferase), and trpC (anthranilate synthase). 

The fungal cell is cultivated in a suitable medium and under suitable conditions for 
screening or selecting for transformants harbouring the variant polynucleotide sequence of 
interest having or encoding the desired characteristic. The cultivation may be performed in 
accordance with methods well-known in the art for screening of polynucleotide sequence 
libraries. 

Replication initiating sequences 

As used herein, the term "fungal replication initiating sequence" is defined as a 
nucleic acid sequence which is capable of supporting autonomous replication of an ex- 
trachromosomal molecule, e.g., a DNA vector such as a plasmid, in a fungal host cell, 
normally without structural rearrangement of the DNA-vector or integration into the host 
cell genome. The replication initiating sequence may be of any origin as long as it is capa- 
ble of mediating replication intiating activity in a fungal cell. For instance the replication 
initiating sequence may be a telomer of human origin which confer to the plasmid the 
ability to replicate in Aspergillus (Aleksenko and Ivanova, Mol.Gen. Genet. 260 (1998) 
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159-164). Preferably, the replication initiating sequence is obtained from a filamentous fun- 
gal cell, more preferably a strain of Aspergillus, Fusariwn or Alternaria, and even more 
preferably, a strain of A. nidulans, A. oryzae, A. niger, F. oxysporum or Alternaria alte- 
nata. 

5 A fungal replication initiating sequence may be identified by methods well-known 

in the art. For instance, the sequence may be identified among genomic fragments derived 
from the organism in question as a sequence capable of sustaining autonomous replication in 
yeast, (Ballance and Turner, Gene, 36 (1985), 321-331), an indication of a capability of 
autonomous replication in filamentous fungal cells. The replication initiating activity in 
10 fungi of a given sequence may also be determined by transforming fungi with contemplated 
plasmid replicators and selecting for colonies having an irregular morphology, indicating 
loss of a sectorial plasmid which in turn would lead to lack of growth on selective medium 
when selecting for a gene found on the plasmid (Gems et al, Gene, 98 (1991) 61-67). 
AMA1 was isolated in this way. An alternative way to isolate a replication initiating se- 
15 quence is to isolate natural occurring plasmids (eg as disclosed by Tsuge et al., Genetics 
146 (1997) 111-120 for Alternaria aternata). 

Examples of fungal replication initiating sequences include, but are not limited to, 
the ANSI and AMA1 sequences of Aspergillus nidulans y e.g., as described, respectively, 
by Cullen, D., et al. (1987, Nucleic Acids Res. 15:9163-9175) and Gems, D., et al. (1991, 
20 Gene 98:61-67). 

The term "replication initiating activity n is used herein in its conventional mean- 
ing, ie to indicate that the sequence is capable of supporting autonomous replication of an 
extrachromosomal molecule, such as a plasmid or a DNA vector in a fungal cell. 

The term "without structural rearrangement of the plasmid" is used herein to mean that 
25 no part of the plasmid is deleted or inserted into another part of the plasmid, nor is any host 
genomic DNA inserted into the plasmid. 

Preferably, the replication initiating sequence to be used in the methods of the present 
invention is a nucleic acid sequence selected from the group consisting of: 

(a) a nucleotide sequence having at least 50% identity with the nucleic acid sequence of 
30 SEQ ID NO: 1 or SEQ ID NO:2, and is capable of initiating replication in a fungal cell; 
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(b) a nucleotide sequence capable of initiating replication which hybridises under low 
stringency conditions with (i) the nucleic acid sequence of SEQ ID NO: 1 or SEQ ID NO: 2, or 
(ii) the respective complementary strands, wherein the low stringency conditions are defined by 
prehybridisation and hybridisation at 42°C in 5x SSPE, 0.3% SDS, 200 ng/ml sheared and de- 
natured salmon sperm DNA, and 25% formamide, and wash conditions are defined at 50°C for 
30 minutes in 2x SSC, 0.2% SDS; and 

(c) a subsequence of (a) or (b), wherein the subsequence is capable of initiating replica- 
tion in a fungal cell. 

In a preferred embodiment, the nucleotide sequence has a degree of identity to the 
nucleic acid sequence shown in SEQ ID NO:l or SEQ ID NO:2 of at least about 50%, 
more preferably about 60%, even more preferably about 70%, even more preferably about 
80%, even more preferably about 90%, and most preferably about 97% identity (hereinafter 
"homologous polynucleotide**). The homologous polynucleotide also encompasses a 
subsequence of SEQ ID NO: 1 or SEQ ID NO:2 which have replication initiating activity in 
fungal cells. For purposes of the present invention, the degree of identity may be suitably 
determined by means of computer programs known in the art, such as GAP provided in the 
GCG program package (Program Manual for the Wisconsin Package, Version 8, August 1994, 
Genetics Computer Group, 575 Science Drive, Madison, Wisconsin, USA 5371 1) (Needleman, 
S.B. and Wunsch, CD., (1970), Journal of Molecular Biology, 48, 443-45), using GAP with 
the following settings for polynucleotide sequence comparison: GAP creation penalty of 5.0 
and GAP extension penalty of 0.3. 

Hybridisation indicates that by methods of standard Southern blotting procedures, the 
replication initiating sequence hybridises to an oligonucleotide probe derived from the 
nucleic acid sequence shown in SEQ ID NO:l or SEQ ID NO:2, under low to high 
stringency conditions (i.e., prehybridisation and hybridisation at 42°C in 5x SSPE, 0.3% 
SDS, 200 |ig/mi sheared and denatured salmon sperm DNA, and either 25, 35 or 50% 
formamide for low, medium and high stringencies, respectively). In order to identify a 
clone or DNA which is homologous with SEQ ID NO: 1 or SEQ ID NO: 2, the hybridisation 
reaction is washed three times for 30 minutes each using 2X SSC, 0.2% SDS preferably at 
least 50°C, more preferably at least 55 °C, more preferably at least 60° C, more preferably 
at least 65°C, even more preferably at least 70°C, and most preferably at least 75°C. 
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The oligonucleotide probe may be the nucleic acid sequence of SEQ ID NO: 1 or SEQ 
ID NO:2, or a respective subsequence thereof. More specifically, the oligonucleotide probe 
can be considerably shorter than the entire sequence, but should be at least 15, preferably at 
least 25, and more preferably at least 40 nucleotides in length. Both DNA and RNA probes 
5 can be used. 

The probes are typically labelled for detecting the corresponding gene (for example, 
with 32 P, 3 H, 35 S, biotin, or avidin). For example, molecules to which a 32 P-, 3 H- or 35 S- 
labelled oligonucleotide probe hybridises may be detected by use of X-ray film. 

When isolating a replication initiating sequence for use in the present invention, a 

10 genomic DNA, cDNA or combinatorial chemical library prepared from such an organism 
as defined above expected to harbour the sequence is screened for DNA which hybridises 
with the oligonucleotide probe described above which has replication initating activity. 
Genomic or other DNA from such other organisms may be separated by agarose or 
polyacrylamide gel electrophoresis, or other separation techniques. DNA from the libraries 

15 or the separated DNA may be transferred to and immobilised on nitrocellulose or other 
suitable carrier material. A clone or DNA which is homologous to SEQ ID NO: 1 or SEQ 
ID NO:2 may then be identified following standard Southern blotting procedures. 

The techniques used to isolate or clone a nucleic acid sequence having replication initiating 
activity are known in the art and include isolation from genomic DNA or cDNA. The 

20 cloning from such DNA can be effected, e.g., by using methods based on polymerase chain 
reaction (PCR) to detect cloned DNA fragments with shared structural features. (See, e.g., 
Innis, et ah, 1990, PCR: A Guide to Methods and Application, Academic Press, New 
York.) Other nucleic acid amplification procedures such as ligase chain reaction (LCR) 
may be used. 

25 In preferred embodiment, the replication initiating sequence has the nucleic acid 

sequence set forth in SEQ ID NO:l or SEQ ID NO:2, or a respective functional 
subsequence thereof. For instance, a functional subsequence of SEQ ID NO: 1 is a nucleic 
acid sequence encompassed by SEQ ID NO:l or SEQ ID NO 2 except that one or more 
nucleotides from the 5 1 and/or 3' end have been deleted. Preferably, a subsequence contains at 

30 least 100 nucleotides, more preferably at least 1000 nucleotides, and most preferably at least 
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2000 nucleotides. In a more preferred embodiment, a subsequence of SEQ ID NO:l contains 
at least the nucleic acid sequence shown in SEQ ID NO: 2. 

PNA yectpr w$ control sequences 
5 In the DNA vector comprising a polynucleotide sequence encoding a fungal selective 

marker, a fungal replication initiating sequence and a polynucleotide sequence of interest 
the polynucleotide sequence may encode a polypeptide in which case it is operably linked to 
one or more control sequences which direct the expression of the coding sequence. Alterna- 
tively, the polynucleotide sequence is a control sequence in which case, depending on the 

10 control sequence in question, it is normally operably linked to a polypeptide encoding se- 
quence in order to be able to assess the activity of the control sequence (and thus be able to 
select variants of the parent control sequence having desired properties). 

The procedures used to ligate the elements of the DNA vector are well known to one skilled 
in the art (see, e.g. , Sambrook et al. , 1989, supra). 

15 The term "operably linked" is defined herein as a configuration in which a control se- 

quence is appropriately placed at a position relative to the polypeptide encoding sequence such 
that the control sequence directs the expression of the polypeptide or is otherwise involved in 
the production of the polypeptide. 

In the following different control sequences are discussed in further detail. The control 

20 sequences are those which are operably linked to the polynucleotide sequences of interest 
(when the polynucleotide sequences of interest encodes a polypeptide), and those which con- 
stitute the polynucleotide sequences of interest (when the polynucleotide sequence is a control 
sequence). It will be understood that one and the same control sequence may be used either as 
a polynucleotide sequence of interest (when the library is a library of control sequences) or as 

25 a control sequence involved in the production of a polypeptide encoded by a polynucleotide 
sequence of interest (when the library is a library of polynucleotide sequence of interest en- 
coding a polypeptide) and the following disclosure is intended to cover both types of use of 
the control sequence. 

The control sequence may be an appropriate promoter sequence, a nucleic acid sequence 
30 which is recognized by a host cell for expression of a polypeptide encoding sequence. The 
promoter sequence contains transcriptional control sequences which mediate the expression of 
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the polypeptide. The promoter may be any nucleic acid sequence which shows transcriptional 
activity in the host cell of choice including mutant, truncated, and hybrid promoters, and may 
be obtained from genes encoding extracellular or intracellular polypeptides either homologous 
or heterologous to the host cell. 

Examples of suitable promoters for directing the transcription of a polypeptide encoding 
nucleotide sequence in a filamentous fungal host cell are promoters obtained from the genes 
encoding Aspergillus oryzae TAKA amylase, Rhizomucor miehei aspartic proteinase, Asper- 
gillus niger neutral alpha-amylase, Aspergillus niger acid stable alpha-amylase, Aspergillus 
niger or Aspergillus awamori glucoamylase (glaA), Rhizomucor miehei lipase, Aspergillus 
oryzae alkaline protease, Aspergillus oryzae triose phosphate isomerase, Aspergillus nidulans 
acetamidase, Fusarium oxysporum trypsin-like protease (U.S. Patent No. 4,288,627), and 
mutant, truncated, and hybrid promoters thereof. Particularly preferred promoters for use in 
filamentous fungal host cells are the TAKA amylase, NA2-tpi (a hybrid of the promoters from 
the genes encoding Aspergillus niger neutral alpha-amylase and Aspergillus oryzae triose 
phosphate isomerase), and glaA promoters. 

The control sequence may also be a suitable transcription terminator sequence, a sequence 
recognized by a filamentous fungal cell to terminate transcription. The terminator sequence is 
operably linked to the 3' terminus of the polypeptide encoding nucleotide sequence. Any ter- 
minator which is functional in the filamentous fungal cell may be used in the present invention. 

Preferred terminators are obtained from the genes encoding Aspergillus oryzae TAKA 
amylase, Aspergillus niger glucoamylase, Aspergillus nidulans anthranilate synthase, Asper- 
gillus niger alpha-glucosidase, and Fusarium oxysporum trypsin-like protease. 

The control sequence may also be a suitable leader sequence, a nontranslated region of a 
mRNA which is important for translation by the filamentous fungal cell. The leader sequence 
is operably linked to the 5' terminus of the polypeptide encoding nucleotide sequence. Any 
leader sequence which is functional in the filamentous fungal cell may be used in the present 
invention. 

Preferred leaders are obtained from the genes encoding Aspergillus oryzae TAKA amylase 
and Aspergillus nidulans triose phosphate isomerase. 

The control sequence may also be a polyadenylation sequence, a sequence which is opera- 
bly linked to the 3' terminus of the polypeptide encoding nucleotide sequence which, when 
transcribed, is recognized by a filamentous fungal cell as a signal to add polyadenosine residues 
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to transcribed mRNA. Any polyadenylation sequence which is functional in the filamentous 
fungal cell may be used in the present invention. 

Preferred polyadenylation sequences are obtained from the genes encoding Aspergillus 
oryzae TAKA amylase, Aspergillus niger glucoamylase, Aspergillus nidulans ant hrani late 
synthase, and Aspergillus niger alpha-glucosidase. 

The control sequence may also be a signal peptide coding region, which codes for an 
amino acid sequence linked to the amino terminus of the polypeptide which can direct the en- 
coded polypeptide into the cell's secretory pathway. The 5* end of the polypeptide encoding 
nucleotide sequence may inherently contain a signal peptide coding region naturally linked in 
translation reading frame with the segment of the coding region which encodes the secreted 
polypeptide. Alternatively, the 5' end of the coding sequence may contain a signal peptide 
coding region which is foreign to the coding sequence. The foreign signal peptide coding re- 
gion may be required where the coding sequence does not normally contain a signal peptide 
coding region. Alternatively, the foreign signal peptide coding region may simply replace the 
natural signal peptide coding region in order to obtain enhanced secretion of the polypeptide. 
The signal peptide coding region may be obtained from a glucoamylase or an amylase gene 
from an Aspergillus species, or a lipase or proteinase gene from a Rhizomucor species. How- 
ever, any signal peptide coding region which directs the expressed polypeptide into the secre- 
tory pathway of a filamentous fungal cell may be used in the present invention. 

An effective signal peptide coding region is the signal peptide coding region obtained from 
the Aspergillus oryzae TAKA amylase gene, Aspergillus niger neutral amylase gene, Rhi- 
zomucor miehei aspartic proteinase gene, or Humicota lanuginosa cellulase gene. 

The control sequence may also be a propeptide coding region, which codes for an amino 
acid sequence positioned at the amino terminus of a polypeptide. The resultant polypeptide is 
known as a proenzyme or propolypeptide (or a zymogen in some cases). A propolypeptide is 
generally inactive and can be converted to a mature, active polypeptide by catalytic or auto- 
catalytic cleavage of the propeptide from the propolypeptide. The propeptide coding region 
may be obtained from the Rhizomucor miehei aspartic proteinase gene, or the Myceliophthora 
thermophila laccase gene (WO 95/33836). 

Where both signal peptide and propeptide regions are present at the amino terminus of a 
polypeptide, the propeptide region is positioned next to the amino terminus of a polypeptide 
and the signal peptide region is positioned next to the amino terminus of the propeptide region. 
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The polypeptide encoding nucleotide sequence may also be operably linked to one or more 
nucleic acid sequences which encode one or more factors that are advantageous for directing 
the expression of the polypeptide, e.g., a transcriptional activator {e.g., a tfraws-acting factor), a 
chaperone, and a processing protease. Any factor that is functional in a filamentous fungal cell 
may be used in the present invention. The nucleic acids encoding one or more of these factors 
are not necessarily in tandem with the polypeptide encoding nucleotide sequence. 

An activator is a protein which activates transcription of a polypeptide encoding nucleo- 
tide sequence (Kudla et al, 1990, EMBO Journal 9: 1355-1364; Jarai and Buxton, 1994, Cwr- 
rent Genetics 26: 2238-244; Verdier, 1990, Yeast 6: 271-297). The nucleic acid sequence en- 
coding an activator may be obtained from the genes encoding Saccharomyces cerevisiae heme 
activator protein 1 (hapJ), Saccharomyces cerevisiae galactose metabolizing protein 4 (gal4), 
Aspergillus nidulans ammonia regulation protein (areA), and Aspergillus oryzae alpha-amylase 
activator (amyK). For further examples, see Verdier, 1990, supra and MacKenzie et ah, 1993, 
Journal of General Microbiology 139: 2295-2307. 

A chaperone is a protein which assists another polypeptide to fold properly (Haiti et al, 
1994, TIBS 19: 20-25; Bergeron etal, 1994, TIBS 19: 124-128; Demolder et al, 1994, Jour- 
nal of Biotechnology 32: 179-189; Craig, 1993, Science 260: 1902-1903; Gething and Sam- 
brook, 1992, Nature 355: 33-45; Puig and Gilbert, 1994, Journal of Biological Chemistry 
269: 7764-7771; Wang and Tsou, 1993, The FASEB Journal 7: 1515-1 1 157; Robinson et al, 
1994, Bio/Technology 1: 381-384; Jacobs etal, 1993, Molecular Microbiology 8: 957-966). 
The nucleic acid sequence encoding a chaperone may be obtained from the genes encoding As- 
pergillus oryzae protein disulphide isomerase or Saccharomyces cerevisiae calnexin, Sac- 
charomyces cerevisiae BiP/GRP78, and Saccharomyces cerevisiae Hsp70. For further exam- 
ples, see Gething and Sambrook, 1992, supra, and Hartl et al, 1994, supra. 

A processing protease is a protease that cleaves a propeptide to generate a mature bio- 
chemically active polypeptide (Enderlin and Ogrydziak, 1994, Yeast 10: 67-79; Fuller et al, 
1989, Proceedings of the National Academy of Sciences USA 86: 1434-1438; Julius et al, 
1984, Cell 37: 1075-1089; Julius et al, 1983, Cell 32: 839-852; U.S. Patent No. 5,702,934). 
The nucleic acid sequence encoding a processing protease may be obtained from the genes en- 
coding Saccharomyces cerevisiae dipeptidylaminopeptidase, Saccharomyces cerevisiae Kex2, 
Yarrowia lipolytica dibasic processing endoprotease (xpr6), and Fusarium oxysporum metal- 
loprotease (p45 gene). 
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It may also be desirable to add regulatory sequences which allow the regulation of the ex- 
pression of the polypeptide relative to the growth of the filamentous fungal cell. Examples of 
regulatory systems are those which cause the expression of the gene to be turned on or off in 
response to a chemical or physical stimulus, including the presence of a regulatory compound. 
The TAKA alpha-amylase promoter, Aspergillus niger glucoamylase promoter, and the Asper- 
gillus oryzae glucoamylase promoter may be used as regulatory sequences. Other examples of 
regulatory sequences are those which allow for gene amplification, e.g., the metallothionein 
genes which are amplified with heavy metals. In these cases, the polypeptide encoding nucleo- 
tide sequence would be operably linked with the regulatory sequence. 

The introduction of the DNA vector into a filamentous fungal cell may involve a process 
consisting of protoplast formation, transformation of the protoplasts, and regeneration of the 
cell wall in a manner known per se. Suitable procedures for transformation of Aspergillus host 
cells are described in EP 238 023 and Yelton et ah, 1984, Proceedings of the National Acad- 
emy of Sciences USA 81: 1470-1474. A suitable method of transforming Fusarium species is 
described by Malardier et al, 1989, Gene 78: 147-156 or in WO 96/00787. 

The fungal cells to be transformed with the population of DNA vectors are filamentous 
fungal cells. 

"Filamentous fungi" include all filamentous forms of the subdivision Eumycota and Oomy- 
cota (as defined by Hawksworth et ah, 1995, supra). The filamentous fungi are characterized 
by a mycelial wall composed of chitin, cellulose, glucan, chitosan, mannan, and other complex 
polysaccharides. Vegetative growth is by hyphal elongation and carbon catabolism is obliga- 
tely aerobic. In contrast, vegetative growth by yeasts such as Saccharomyces cerevisiae is by 
budding of a unicellular thallus and carbon catabolism may be fermentative. In a preferred em- 
bodiment, the filamentous fungal cell is a cell of a species of, but is not limited to, Acremo- 
nium, Aspergillus, Fusarium, Humicola, Mucor, Myceliophthora, Neurospora, Penicillium, 
Scytalidium, Thielavia, Tolypocladium, and Trichoderma. 

The filamentous fungal cell to be used in the present invention is normally chosen on the 
basis of the polynucleotide sequence of interest. For instance, if the polynucleotide of interest 
is a control sequence intended for use in an Aspergillus cell the filamentous fungal cell is nor- 
mally an Aspergillus cell. Examples of filamentous fungal cells of use in the present invention 
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include an Aspergillus cell, an Acremonium cell, a Fusahum cell, a Hwnicola cell, a Mucor 
cell, a Myceliophthora cell, a Neurospora cell, a Penicillium cell, a Thielavia cell, a Tolypo- 
cladium cell, and a Trichoderma cell. 

More specifically, the filamentous fungal cell is an Aspergillus awamori, Aspergillus foeti- 
dus, Aspergillus japonicus^ Aspergillus nidulans, Aspergillus niger y or Aspergillus oryzae cell; 
a Fusarium bactridioides, Fusarium cerealis, Fusarium crookwellense y Fusarium culmorum, 
Fusarium graminearum, Fusarium graminum, Fusarium heterosporum y Fusarium negundi, 
Fusarium oxysporum, Fusarium reticulatum y Fusarium roseum, Fusarium sambucinum, 
Fusarium sarcochroum, Fusarium sporotricioides, Fusarium sulphur eum y Fusarium torulo- 
sum, Fusarium trichothecioides, Fusarium venenatum cell or a Fusarium venenatum cell 
(Nirenberg sp. nov; a Humicola insolens cell or a Humicola lanuginosa cell; a Mucor miehei 
cell; a Myceliophthora ihermophila cell; a Neurospora crassa cell; a Penicillium purpuro- 
genum cell; a Thielavia terrestris cell; or a Trichoderma harzianum, Trichoderma koningii, 
Trichoderma longibrachiatum y Trichoderma reesei, or Trichoderma viride cell. 

Selecting or screening the library for transformants of interest 

It will be understood that the method to be used for performing the selection or 
screening step c) of the method of the invention will depend on the polynucleotide sequence 
of interest in question. The term "selecting or screening" for one or more transformants 
expressing a desired characteristic" as used in step c) is intended to indicate that the 
screening or selecting is performed so as to identify transformants containing a modified 
polynucleotide sequence of interest (which has been generated on the basis of the 
polynucleotide sequences of interest during the cultivation step b) which has the desired 
activity or function and optionally further desired characteristics as exemplified below. 
Thus, if the polynucleotide sequence of interest encodes a polypeptide with a certain 
activity or function the selection will only allow the transformants expressing a polypeptide 
with the desired activity or function, to grow. Thus, if the polynucleotide sequence of 
interest encodes a polypeptide with a certain activity or function the screening will be 
performed to identify transformants expressing a polypeptide with the desired activity or 
function. For instance, if the polynucleotide sequence of interest encodes an enzyme, such 
as a lipase, the selection or screening step c) will be performed to identify transformants 
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expressing lipase activity. If it is desired that the lipase to be identified as a specific 
characteristic, such as a high thermostability, the screening is to be performed under 
conditions (typically temperatures) at which lipases with the desired high thermostability 
can be identified. 

Analogously, if the polynucleotide sequence of interest is a control sequence such as a 
promoter sequence the selection or screening step c) is performed under condition in which 
promoter activity can be assessed. Typically, in the library the promoter polynucleotide 
sequences of interest are operably linked to a second sequence to be transcribed (eg a 
polypeptide encoding sequence) so that the promoter activity can be assayed with reference 
to the transcription of said second sequence. 

Library construction in bacterial o r yeast hosts 

The present invention also relates to methods of constructing and screening or selecting 
a library of polynucleotide sequences of interest in a filamentous fungal cell, comprising: 

(a) transforming a culture of bacterial or yeast cells with a population of DNA vectors, 
wherein each vector comprises 

(i) a polynucleotide sequence encoding a filamentous fungal selective marker, a fila- 
mentous fungal replication initiating sequence, a bacterial or yeast selective marker 
and a bacterial or yeast replication initiating sequence, respectively, none of which 
vary substantially within the population of DNA vectors, and 

(ii) a polynucleotide sequence of interest, wherein more than one variant of the polynu- 
cleotide sequence is contained within the population of DNA vectors; 

(b) cultivating the bacterial or yeast cells under selective pressure; 

(c) isolating the DNA vectors from the transformants of (b); 

(d) transforming filamentous fungal cells with the DNA vectors of (c); 

(e) cultivating the filamentous fungal cells of (d) under selective pressure; 

(f) selecting or screening for one or more filamentous fungal transformants expressing a 
desired characteristic; and 

(g) isolating the filamentous fungal transforrnant(s) of interest. 

The advantage of using yeast or bacteria as intermediate hosts is that the transformation 
frequency is 100-1000 fold higher than for filamentous fungal cells such as Aspergillus, re- 
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suiting in a larger library when first transforming the manipulated and optionally ligated 
DNA vectors into yeast or bacteria, and then transforming the isolated supercoiled vectors 
into the filamentous fungal cell. 

In a preferred embodiment, the library of polynucleotide sequences of interest is pre- 
pared by random mutagenesis or naturally occurring allelic variants of at least one parent 
polynucleotide sequence having or encoding a biological function of interest as described 
above. 

In another preferred embodiment, the bacterial cell is a strain of E. coli. 

In another preferred embodiment, the yeast cell is a strain of Saccharomyces sp., in 
particular, strains of Saccharomyces cerevisiae or Saccharomyces kluyveri or 
Schizosaccharomyces sp. Further examples of suitable yeast cells are strains of Kluyveromyces, 
such as K. lac t is, Hansemda, e.g. H. potymorpha, or Pichia, e.g. P. pastoris. 

According to the method above, the DNA vector contains a selective marker which per- 
mits easy selection of transformed bacterial or yeast cells. Examples of bacterial selective mark- 
ers include, but are not limited to, markers which confer antibiotic resistance, such as ampicillin, 
kanamycin, erythromycin, chloramphenicol or tetracycline resistance. Furthermore, selection 
may be accomplished by co-transformation, e.g. , as described in WO 91/09129, where the se- 
lective marker is on a separate vector. 

Suitable markers for yeast host cells are ADE2, HIS3, LEU2, LYS2, MET3, TRP1, and 
URA3. 

In a preferred embodiment, the bacterial or yeast selective marker provides for resistance 
to a biocide, wherein the biocide agent is selected from the group consisting of ampicillin, 
kanamycin, tetracyclin, chloramphenicol, neomycin, hygromycin and methotrexate. 

In another preferred embodiment, the bacterial or yeast selective marker is selected 
from the group of genes which encode a product which provides for resistance to biocide or 
viral toxicity, resistance to heavy metal toxicity, or prototrophy to auxotrophs. 

In another preferred embodiment, the prototrophy is obtained from an enzyme selected 
from the group of metabolic pathways consisting of nucleotide synthesis, cotactor synthesis, 
amino acid synthesis, acetamide metabolism, proline metabolism, sulfate metabolism, and 
nitrate metabolism. 
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For autonomous replication, the vector may further comprise an origin of replication ena- 
bling the vector to replicate autonomously in the bacterial or yeast cell in question. Examples of 
bacterial origins of replication are the origins of replication of plasmids pBR322, pUC19, pA- 
CYC177, and pACYC184 permitting replication in E. coli. 

Examples of origins of replication for use in a yeast host cell are the 2 micron origin of 
replication, ARS1, ARS4, the combination of ARS1 and CEN3, and the combination of ARS4 
and CEN6. 

The transformation of the bacterial or yeast cell may, for instance, be effected by using 
competent cells, by electroporation, or by conjugation of bacterial cells (see, e.g., J. Sambrook, 
E.F. Fritsch, and T. Maniatus, 1989, Molecular Cloning, A Laboratory Manual, 2d edition, 
Cold Spring Harbor, New York). The cultivation of these cells under selective pressure may be 
conducted in accordance with methods known in the art. 

The filamentous fungal selective marker, fungal replication initiating sequence and the 
polynucleotide sequence of interest is preferably as herein, eg in the sections entitled: 
"Filamentous fungal selective marker" and "Replication initiating sequences". 

The selection or screening of transform ants of interest may be performed as described above 
in the section entitled "Selection and screening of transformants". 

Polynucleotide sequences of interest 

It will be understood that the present invention will be useful for screening or selecting li- 
braries of any type of polynucleotide sequence of interest. More specifically, the polynucleo- 
tide sequences of interest to be used in the methods of the present invention may be gener- 
ated from a parent polynucleotide sequence having or encoding a biological activity or 
function of interest. For instance, the polynucleotide sequences of interest are generated 
from a gene encoding a polypeptide of interest by random mutagenesis of the gene. Alter- 
natively, the polynucleotide sequence of interest is a control sequence as defined above, 
e.g., a promoter sequence. Also, the polynucleotide sequences of interest may be derived 
from different sources, such as different naturally occurring sources such as microorgan- 
isms, as described further above. The polynucleotide sequence of interest may be a com- 
bination of a polypeptide encoding sequence and a control sequence. 
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In a preferred embodiment the modification of a parent polynucleotide sequence is done 
by use of a physical or chemical mutagenizing agent, use of a doped oligonucleotide, DNA 
shuffling, or by subjecting the nucleic acid sequence to PCR generated mutagenesis, or use 
of any combination thereof. 

Alternatively, the polynucleotide sequences of interest may be obtained by subjecting 
the sequence to mutagenesis by misincorporation of nucleotide bases by using an error- 
prone polymerase or a polymerase working under suboptimal conditions in order to promote 
formation of errors, i.e., error-prone PCR. Error-prone DNA synthesis may be carried out 
in vitro or in vivo, such as by use of various mutator strains. 

Random mutagenesis 

A general approach to generate variant polynucleotide sequences encoding modified pro- 
teins and enzymes has been based on random mutagenesis, for instance, as disclosed in US 
4,894,331 and WO 93/01285. This approach may also be used in connection with the present 
invention. For instance, the random mutagenesis may be performed by use of a suitable physical 
or chemical mutagenizing agent, by use of a suitable oligonucleotide, or by subjecting the 
polynucleotide sequence to PCR generated mutagenesis. Furthermore, the random mutagenesis 
may be performed by use of any combination of these mutagenizing agents. The mutagenizing 
agent may, e.g., be one which induces transitions, transversions, inversions, scrambling, dele- 
tions, and/or insertions. 

Examples of a physical or chemical mutagenizing agent suitable for the present purpose in- 
clude ultraviolet (UV) irradiation, hydroxylamine, N-methyl-N'-nitro-N-nitrosoguanidine 
(MNNG), Omethyl hydroxylamine, nitrous acid, ethyl methane sulphonate (EMS), sodium bi- 
sulphite, formic acid, and nucleotide analogues. When such agents are used, the mutagenesis is 
typically performed by incubating the polynucleotide sequence of interest in the presence of the 
mutagenizing agent of choice under suitable conditions for the mutagenesis to take place. 

When the mutagenesis is performed by the use of an oligonucleotide, the oligonucleotide 
may be doped or spiked with the three non-parent nucleotides during the synthesis of the oli- 
gonucleotide at the positions which are to be changed. The doping or spiking may be done so 
that codons for unwanted amino acids are avoided. The doped or spiked oligonucleotide can be 
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incorporated into the polynucleotide of interest by any published technique, using e.g. PCR, 
LCR or any DNA polymerase and ligase as deemed appropriate. 

Preferably, the doping is carried out using "constant random doping" , in which the per- 
centage of wild-type and mutation in each position is predefined. Furthermore, the doping 
may be directed toward a preference for the introduction of certain nucleotides, and thereby 
a preference for the introduction of one or more specific amino acid residues. The doping 
may be made, e.g., so as to allow for the introduction of 90% wild type and 10% mutations 
in each position. An additional consideration in the choice of a doping scheme is based on 
genetic as well as protein-structural constraints. The doping scheme may be made by using 
the DOPE program (cf., Tomandl, D., et al., 1997. Journal of Computer- Aided Molecular 
Design 11:29-38; Jensen, U, Andersen, KV, Svendsen, A., and Kretzschmar, T., 1998. 
Nucleic Acids Research 26:697-702) which, inter alia, ensures that introduction of stop 
codons is avoided. 

When PCR-generated mutagenesis is used, either a chemically treated or non-treated parent 
polynucleotide sequence of interest is subjected to PCR under conditions that increase the mis- 
incorporation of nucleotides (Deshler, J.O., 1992. Genetic Analysis, Techniques and Applica- 
tions 9:103-106; Leung, et al., 1989. Technique 1:11-15. 

A mutator strain of E. coli (Fowler, et al., 1974. Molec. Gen. Genet. 133:179-191), 5. 
cereviseae or any other microbial organism may be used for the random mutagenesis of the par- 
ent polynucleotide sequence of interest by, e.g., transforming a plasmid containing the parent 
polynucleotide sequence into the mutator strain, growing the mutator strain with the plasmid and 
isolating the mutated plasmid from the mutator strain. The mutated plasmid may be subse- 
quently transformed into the expression organism. 

The polynucleotide sequence to be mutagenised may be conveniently present in a genomic 
or cDNA library prepared from an organism harbouring the sequence. Alternatively, the se- 
quence may be present on a suitable vector such as a plasmid or a bacteriophage, which as such 
may be incubated with or otherwise exposed to the mutagenising agent. The polynucleotide se- 
quence to be mutagenised may be in isolated form. It will be understood that the polynucleotide 
sequence to be subjected to random mutagenesis is preferably a cDNA or a genomic DNA se- 
quence. The mutated DNA sequence may further comprise a DNA sequence encoding functions 
permitting expression of the mutated DNA sequence. 
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DNA shuffling 

Alternative methods for rapid preparation of variants of a polynucleotide sequence of inter- 
est in accordance with the present invention include methods of in vivo or in vitro DNA shuf- 
fling, wherein DNA shuffling is defined as recombination, either in vivo or in vitro, of nucleo- 
tide sequence fragments) between two or more homologous polynucleotides resulting in output 
polynucleotides (i.e., polynucleotides which have been subjected to a shuffling cycle) containing 
a number of exchanged nucleotide fragments when compared to the input polynucleotides (i.e., 
the polynucleotides subjected to shuffling). Shuffling may be accomplished either in vitro or in 
vivo by recombination within a cell by methods described in the art, examples of which are 
listed below. 

For instance, H. Weber and Weissmann, C. (1983. Nucleic Acids Research 11:5661-5669) 
describe a method for modifying genes by in vivo recombination between two homologous 
genes, in which recombinants are identified and isolated using a resistance marker. 

Pompon, et al. f (1989, Gene 83:15-24) describe a method for shuffling gene domains of 
mammalian cytochrome P-450 by in vivo recombination of partially homologous sequences in 
Saccharomyces cerevisiae by transforming Saccharomyces cerevisiae with a linearized plasmid 
with filled-in ends, and a DNA fragment being partially homologous to the ends of said plasmid. 

In WO 97/07205 a method is described in which polypeptide variants are prepared by shuf- 
fling different nucleotide sequences of homologous DNA sequences by in vivo recombination 
using plasmid DNA as template. 

US patent no. 5,093,257 (Genencor International, Inc.) discloses a method for producing 
hybrid polypeptides by in vivo recombination. 

In a preferred embodiment, the variant polynucleotide sequences of interest are ob- 
tained by in vivo recombination between two or more homologous nucleic acid sequences of 
interest, comprising: 

(a) identifying at least one conserved region between the polynucleotide sequences of inter- 
est; 

(b) generating fragments of each of the polynucleotide sequences of interest, wherein the 
fragments comprise the conserved region(s) of (a); and 
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(c) recombining the fragments of (b) by using the conserved region(s) as (a) homologous 
linking point(s). 

Preferably, the polynucleotide sequences of interest encode a polypeptide or a part 
thereof or are control sequences as defined above or any combination of both. 
5 The term "conserved region" denotes a subsequence, preferably of at least 10 bp, 

shared by two or more sequences in which there is a degree of identity between the subse- 
quences of at least about 50%, more preferably at least about 60%, even more preferably at 
least about 70%, even more preferably at least about 80%, even more preferably at least 
about 90%, and most preferably at least about 97%. 

10 In order for the conserved region to be used as a "linking point" between two se- 

quences, the degree of identity between the sequences within the conserved region(s), is 
sufficiently high to enable hybridisation, e.g., under conditions described supra, between 
the sequences, whereby the conserved region serves as the linking point. 

One method for shuffling of homologous DNA sequences in vitro has been described by 

15 Stemmer (Stemmer, 1994. Proc. Nad. Acad. Sci. USA, 91:10747-10751; Stemmer, 1994. Na- 
ture 370:389-391; Crameri, A., et al., 1997. Nature Biotechnology 15:436-438). The method 
relates to shuffling homologous DNA sequences by using in vitro PCR techniques. Positive re- 
combinant genes containing shuffled DNA sequences are selected from a DNA library based on 
the improved function of the expressed proteins. 

20 The above method is also described in WO 95/22625 in relation to a method for shuffling 

homologous DNA sequences. An important step in the method is to cleave the homologous 
template double-stranded polynucleotide into random fragments of a desired size followed by 
homologously reassembling the fragments into full-length genes. 

WO 98/41653 discloses a method of DNA shuffling in which a library of recombined ho- 

25 mologous polynucleotides is constructed from a number of different input DNA templates and 
primers by induced template shifts during in vitro DNA synthesis. 

Localized random mutagenesis 

The random mutagenesis may be advantageously localised to a part of a parent polynucleo- 
30 tide sequence in question. The sequence to be modified may be, for example, the coding region 
of a gene, or a part thereof, essential for activity of the gene product, or a control sequence or 
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part thereof as defined above. A preferred example of such control sequence is a promoter 
sequence, or a functional part thereof, i.e., a part which is sufficient for affecting expression of 
the nucleic acid sequence. 

Localised random mutagenesis may, e.g., be advantageous when certain regions of the 
polynucleotide sequence of interest have been identified to be of particular importance. For in- 
stance, when the polynucleotide sequence of interest encodes a polypeptide the important region 
may be one essential for a given property of the polypeptide, the modification of which region is 
expected to result in a variant in which this property has been improved. Such regions may 
normally be identified when the tertiary structure of the parent polypeptide has been elucidated 
and related to its function. Analogously, when the polynucleotide sequence of interest is a con- 
trol sequence, such as a promoter, the region of interest may be one expected to be essential to 
or involved in promoter activity. 

The localised, or region-specific, random mutagenesis is conveniently performed by use of 
PCR generated mutagenesis techniques as described above or any other suitable technique 
known in the art. Alternatively, the nucleotide sequence encoding the part of the polynucleotide 
sequence to be modified may be isolated, e.g., by insertion into a suitable vector, and said part 
may be subsequently subjected to mutagenesis by use of any of the mutagenesis methods dis- 
cussed above. 

Use of a fungal replication initiating sequence 

In a further aspect the invention relates to the use of a fungal replication initiating se- 
quence as defined herein for constructing and selecting or screening a library of polynucleotide 
sequences of interest in filamentous fungal cells. Preferably, the library is constructed by the 
method according to the first or second aspect of the invention. In a preferred embodiment the 
fungal replication initiating sequence is selected from the group consisting of: 
(a) a replication initiating sequence having at least 50% identity with the nucleic acid sequence 
of SEQ ID NO: 1 or SEQ ID NO:2 and is capable of initiating replication; 
(b) a replication initiating sequence which hybridises under low stringency conditions with 
(i) the nucleic acid sequence of SEQ ID NO: 1 or SEQ ID NO:2, or (ii) the respective 
complementary strands, wherein the low stringency conditions are defined by prehy- 
bridization and hybridization at 42°C in 5x SSPE, 0.3% SDS, 200 ng/ml sheared and 
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denatured salmon sperm DNA, and 25% formamide, and wash conditions are defined 

at 50°C for 30 minutes in 2x SSC, 0.2% SDS; and 
(c) a subsequence of (a) or (b), wherein the subsequence has replication initiating activity. 
Preferably, the replication initiating sequence is obtained from a filamentous fungal cell, in 
particular from a strain of Aspergillus, such as A. nidulans. In the most preferred embodi- 
ment the replication initiating sequence has the nucleic acid sequence set forth in SEQ ID 
NO:l or SEQ ID NO: 2, or is a respective functional subsequence thereof. The replication 
intiating sequence to be used in accordance with this aspect may be as described in the sec- 
tion above entitled "Replication initiating sequences". 

Library of polynucleotide sequences of interest 

In a further aspect the invention relates to a library of polynucleotide sequences of inter- 
est which library comprises filamentous fungal cells transformed with a population of DNA 
vectors, wherein each vector comprises: 

(i) a gene encoding a fungal selection marker and a fungal replication initiating sequence 

wherein the marker and the replication initiating sequence do not vary within 
the population; and 

(ii) a polynucleotide sequence of interest wherein the population of DNA vectors contains 
more than one variant of the polynucleotide sequence. 

In a preferred embodiment the vector further comprises a nucleic acid sequence encod- 
ing a bacterial or yeast selective marker and a bacterial or yeast replication initiating se- 
quence. Preferably, the library is prepared by the method according to the first or second 
aspect of the invention. Preferably, the elements of the library, such as the DNA vector and 
its mentioned components are as described herein. 

The present invention is further described by the following examples which should not be 
construed as limiting the scope of the invention. 

Examples 

Materials 
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Chemicals used as buffers and substrates were commercial products of at least reagent 
grade. 



Plasmids 



pMT1505: 
pHelpl: 



pToC68: 

pAHL: 

pS02: 

pENI1127: 

pENI1245: 

pENI1246: 

pENI1298: 

pENI1299: 



constructed as described below in Example 1 

contains the pyrG gene from A. oryzae as a selective marker and the AMA1 

sequences which enable autonomous replication in A. nidulans as described 

by Gems, D., et al. (1991. Gene 98: 61-67) 

as described in EP 0 531 372 (Novo Nordisk A/S) 

as described in WO 92/05249, containing a lipase encoding sequence 

as described in WO 96/29391 , containing the Aspergillus oryzae pyrG gene 

constructed as described below in Example 1 

constructed as described below in Example 1 

constructed as described below in Example 1 

constructed as described below in Example 1 

constructed as described below in Example 1 



Strains 

JaL250: a derivative of Aspergillus oryzae A 1560 in which the pyrG gene has been 

inactivated, as described in WO 98/01470 
DH5a: an E. coli host cell purchased from GIBCO BRL (Life Technologies, Inc., 

Rockville MD) 



DLM15: JVi/TVB: hvor er den beskrevet (i Royer et al nsevnt i Example 5?) 



Example 1: Construction of pENI1298 and pENI1299 



pMT1466 was constructed by inserting an Sphl/Narl fragment from pHelpl into 
pToC68. pMT1489 was constructed by digesting pMT1466 with SphI and StuI, then reli- 
gating, pMT1500 was constructed by digesting pMT1489 with Aatll and Narl and ligating 
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a linker. pMT1504 was constructed by digesting pMT1500 with Nhel and reli gating. 
pMT1505 was constructed by inserting a 2.7 kb Xbal fragment containing the amdS en- 
coding gene from A. nidulans genomic DNA (Corrick, CM., et al. 1987, Gene 53:63-71) 
into pMT1504 which had been cut with Nhel. pENI1127 was constructed from pMT1505 
5 which had been digested with Sail in order to remove one of the AMA1 repeats and the 
central spacer region. 

pENI1127 and pMT1505 were each cut with Hindm, resulting in fragments of 2408 
bp and 5253 bp, respectively, which were purified from a gel of 1% agarose and cloned 
into the Hindm site in the vector pAHL which contains a lipase encoding sequence. The 
10 resulting plasmids were called pENI1245 and pENI1246, respectively. 

A 3527 bp Stul/Bbul fragment containing the pyrG gene was excised from pS02 and 
inserted into a Stul/Bbul site in both pENI1245 and pENI1246. The resulting plasmids 
were called pENI1298 and pENI1299, respectively. The restriction map for pENI1298 is 
shown in Figure 1, and for pENI1299, in Figure 2. 

15 

Example 2: Expression Levels of Lipase in Independently Grown Aspergillus oryzae 
Transformants 

JaL250 was transformed with pENI1298 and pENI1299 using standard procedures, cf., 
20 as described in WO 98/01470. The cells were then cultured on Cove plates at 37 d C. 

Transformants appeared after three days incubation at a transformation frequency of 10 4 
- 10 s /|ig DNA, an increase of 100- to 10,000-fold over the transformation frequency in the 
absence of an AMA1 sequence. 

Thirty independent transformants from the pENI1298 and pENI1299 transformations 
25 were isolated on Cove plates and at the same time inoculated into a 96-well microtiter dish 
containing 200 |jtl minimal media of l*vogel, 2% maltose (e.g., Methods in Enzymology . 
Vol. 17 p. 84) in each well. 

After three days of incubation at 34°C, media from the cultures in the microtiter dish 
were assayed for lipase activity. A 10 |il aliquot of media from each well was added to a 
30 microtiter well containing 200 \il of a lipase substrate of 0.018% p-nitrophenylbutyrate, 
0.1% Triton X-100, 10 mM CaCl 2 , 50 mM Tris pH 7.5. Activity was assayed spectro- 
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photometrically at 15-second intervals over a five minute period, using a kinetic microplate 
reader (Molecular Device Corp,, Sunnyvale CA), using a standard enzymology protocol 
(e.g., Enzyme Kinetics, Paul C.Engel, ed., 1981, Chapman and Hall Ltd.) Briefly, product 
formation is measured during the initial rate of substrate turnover and is defined as the slope 
of the curve calculated from the absorbance at 405 nm every 15 seconds for 5 minutes. The 
arbitrary lipase activity units were normalized against the transformant showing the highest 
lipase activity. For each group of thirty transformants an average value and the standard 
deviation were calculated. 

At the same time, the transformants which had been cultured on Cove plates for three 
days at 37°C were reisolated onto a second Cove plate and reinoculated into a microtiter 
well as before. The procedure of assay, reisolation and reinoculation was repeated once 
more after an additional three days of culture. 

The results, summarized below in Table 1, show the amount of lipase produced, rela- 
tive to the amount produced by the pENI1298 transformants six days after transformation. 
As indicated by the low values for the standard deviation, there is little variation in the li- 
pase expression level among the 30 independently grown transformants. Usually, when do- 
ing transformation of filamentous fungi, a much larger relative standard deviation is seen 
(70% -100%), due to both random integration into the genome of the vector and differences 
in numbers of vectors getting integrated. There can be 100-1000 fold difference in expres- 
sion levels between the worst and the best producing fungal transformant. 
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Table 1. The average expression levels of lipase from 30 independently grown transfor- 
mants relative to pENI1298 six days after transformation and the standard deviation for 
each. 

Example 3: Testing for rearrangment of the plasmid 

Transformants of Jal250 containing either pENI1298 or pENI1299 were grown in YPD 
medium overnight. 

DNA was prepared from each of the transformants using a QIAprep® Miniprep Kit 
(QIAGEN, Venlo, The Netherlands) in which the procedure provided by the manufacturer 
had been modified. Briefly, each strain was grown in 5 ml YPD for three days. The my- 
celia were collected by filtration and washed with 200 ml of water, then transferred to a 2 
ml microfuge tube and lyophilized by centrifugation under vacuum for three hours at 60°C. 
The dried mycelia was then ground and resuspended in one ml of lysis buffer (100 mM 
EDTA, 10 mM Tris pH8.0, 1 % tritonX-100, 500 mM guanidine-HCl, 200 mM NaCl), 
followed by thorough mixing. Twenty ^ig RNAse was added to each tube which was then 
incubated at 37°C for 10 min. One hundred ng proteinase K was added, and the reaction 
was incubated for 30 min. at 50°C. Each tube was then centrifuged for 15 min at top speed 
in a standard bench top microfuge. The supernatant was applied onto a QIAprep* spin col- 
umn, then centrifuged and filtrate discarded. The column was next washed in 0.5 ml PB 
provided in the kit, and centrifuged again for one minute. After the filtrate was discarded, 
the column was washed in 0.75 ml PE provided in the kit, then centrifuged once more for 
one minute. The column was allowed to air dry, and the DNA was eluted by addition of 
100 \xl TE buffer followed by a final one min spin. 

Transformants were obtained when the DNA was transformed into E. coli DH5ct, con- 
firming that the plasmids remained episomal in A. oryzae. Plasmid DNA was purified from 
the bacterial cells using QIAprep* Miniprep Kit (QIAGEN), according to the manufac- 
turer's instructions. 

The purified plasmid DNA was then digested with Seal, and the restriction pattern was 
analyzed by standard agarose gel fractionation techniques. 
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When compared to the restriction patttern of the original pi as mid, the results showed no 
rearrangement in five pENI1299 JaL250 transformants and only one in eight pENI1298 
JaL250 transformants, indicating the rarity of plasmid rearrangement events. 

Example 4: Screening a Library 

In order to identify variant polypeptides with improved functional characteristics ex- 
pressed at a level comparable to the parental polypeptide a library of variant polynucleotide 
sequences was constructed and screened in Aspergillus oryzae. Polymerase chain reactions, 
using pENI1298 as the template and 2 pmol/|al of each primer: oligo 19670 as one primer, 
and as a second primer, oligo 23701, 23702 or 23703, as described below, were run under 
the following conditions: 94°C, 5 min.; 30 cycles of (94°C, 30 sec.; 50°C, 30 sec.; 72°C, 1 
min.), and 72°C, 5 min. A commercial Taq polymerase, AmpliTaq, was used as recom- 
mended by the supplier (Perkin-EImer Corp., Branchburg NJ, USA). The sequence of 
19670 is shown below: 

19670: CCCCATCCTTTAACTATAGCG 

The sequence of the doped oligos are shown below: 

2370 1 : CGTGGACGGCCTTGGCTAGCCCTCCTCS ACS ACS ACS AGTCTCGC AGG AT- 
CTG 

23702: CTGCGTGGACGGCCTTGGCTAGCCCTCCTCSACSACSACSACSAGTCTCG- 

CAGGATCTGTTTAACCAG 
23703: TCTCTGCGTGGACGGCCTTGGCTAGCCCTCCTCSACSACSACSACSACSA- 

GTCTCGCAGGATCTGTTTAACCAG 

The resulting products were purified using microfiige spin columns S300 (Pharmacia- 
LKB, Uppsala SE). The purified products, including pENI1298 DNA, were then subjected 
to a second round of PGR amplification under the following conditions: 94°C, 5min.; 30 



31 



cycles of (94°C, 1 min.; 50°C, 10 min.; 72°C, 2 min.; and 72°C, 7 min., using the follow- 



ing primer: 



19671: CTCCCTTCTCTGAACAATAAACCC 



The products were next subjected to Dpnl digest to remove template DNA, and then 
incubated at 94°C for 30 min. to inactivate the Dpnl enzyme. 

JaL250 was transformed using 2 ng of pENI1298, which had been digested with Ball 
and SgrAl to remove most of the lipase encoding sequence, and 5 \ig of one of the PCR 
fragments originating from the 23701, 23702 or 23703 doped-oligo reactions. The vector 
and the PCR fragment were allowed to recombine in vivo as described in WO 97/07205. 
JaL250 cells were also transformed with the digested pENI1298 or each of the PCR prod- 
ucts alone. One transformant was obtained when the cells were transformed with the vector 
alone, and no transformants resulted from transformation with the PCR fragment alone. 

Fifteen, 12 and 26 transformants were inoculated from the 23701, 23702 and 23703 
libraries, respectively, into microliter plates containing 200 *il of l*vogel, 2% glucose me- 
dia and incubated for 72 hours. Jal250 transformants of pENI1298 were also inoculated as 
a control. The cultures were then streaked onto Cove plates. Lipase activity in the mi- 
crotiter cultures was assayed as described above in Example 2 and in a detergent containing 
assay in which 10 yil of the microtiter culture was added to 200 nl of a lipase substrate pre- 
pared using a commercial laundry detergent in microtiter wells. Activity was measured 
spectrophotometrically at 405 nm and calculated as described above in Example 2. 

Each of the transformants which showed activity in the detergent containing assay was 
reisolated on a Cove plate. From each of the Cove plates that had been incubated for 72 
hours at 37°C > two colonies were inoculated into a microtiter dish as described above, along 
with ten pENI1298 transformants, then cultured for 72 hours at 34°C. All the clones ex- 
hibited activity in the detergent assay, whereas the pENI1298 transformants did not. Fur- 
thermore, the transformants grown as independent duplicate cultures showed relatively 
similar levels of activity. The results are summarized below in Table 2. 
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Table 2. Lipase activity on pnp-butyrate and a commercial laundry detergent. 



Because clone 23701. 1 showed excellent expression as well as performance in both the 
5 pnp-butyrate and detergent assays, DNA was isolated in order to determine the deduced N- 
terminal sequence of the polypeptide, which was determined to be SPPRRPP. The natu- 
rally occurring N-terminal sequence is SPIRRE, which encodes a propeptide that is cleaved 
off by a kex2-like protease. DNA from some of the other transformants was sequenced, 
giving rise to the following deduced N-terminal sequences: 23701.2 (SPPRRRR), 23702. 1 
10 (SPPRPRRR), 23702.7 (SPPRPRRP), 23703.1 (SPPRRRRKP), and 23703.4 
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(SPPRRPRKR). Thus, a variant polypeptide with improved functional characteristics which 
can be expressed at a level comparable to the parental polypeptide was identified in a pro- 
duction host. 

The invention described and claimed herein is not to be limited in scope by the specific 
embodiments herein disclosed, since these embodiments are intended as illustrations of several 
aspects of the invention. Any equivalent embodiments are intended to be within the scope of 
this invention. Indeed, various modifications of the invention in addition to those shown and 
described herein will become apparent to those skilled in the art from the foregoing descrip- 
tion. Such modifications are also intended to fall within the scope of the appended claims. 

All publications mentioned herein are incorporated herein by reference for the purpose of 
describing and disclosing the particular information for which the publication was cited. The 
publications discussed above are provided solely for their disclosure prior to the filing date of 
the present application. Nothing herein is to be construed as an admission that the inventors 
are not entitled to antedate such disclosure by virtue of prior invention. 

It is to be understood that this invention is not limited to the particular methods and com- 
positions described as such may, of course, vary. It is also to be understood that the terminol- 
ogy used herein is for the purpose of describing particular embodiments only, and is not in- 
tended to be limiting since the scope of the present invention will be limited only by the ap- 
pended claims. 

Unless defined otherwise all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although any materials or methods similar or equivalent to those described herein 
can be used in the practice or testing of the present invention, the preferred methods and mate- 
rials are described. 

Example 5: Construction of Fusarium venenatum strain DLM15 
Strains 

Fusarium strain A3/5, now reclassified as Fusarium venenatum (Yoder and 
Christianson, 1998, Fungal Genetics and Biology 23: 62-80; O'Donnell et al, 1998, 
Fungal Genetics and Biology 23: 57-67), was obtained from Dr. Anthony Trinci, 
University of Manchester, Manchester, England, or from the American Type Culture 
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Collection, Manassas, VA, as Fusarium strain ATCC 20334. A morphological mutant of 
Fusarium venenatum A3/5 designated CC1-3 (Wiebe et al., 1992, Mycological Research 
96: 555-562; Wiebe et al. 9 1991, Mycological Research 95: 1284-1288; Wiebe et al. 9 
1991, Mycological Research 96: 555-562) is a highly branched, colonial variant. The strain 
5 used in this experiment MLY3 was derived from the Fusarium venenatum A3/5 
morphological mutant CC1-3. It has identical complementation characteristics to CC1-3. 

Media and Solutions 

Minimal medium was composed per liter of 6 g of NaN0 3 , 0.52 g of KC1, 1.52 g of 
10 KH 2 P0 4 , 1 ml of COVE trace metals solution, 1 g of glucose, 500 mg of MgS0 4 7H 2 0, 
342.3 g of sucrose, and 20 g of Noble agar at pH 6.5. 

RA sporulation medium was composed per liter of 50 g of succinic acid, 12.1 g of 
NaN0 3 , 1 g of glucose, 20 ml of SOX Vogels, and 0.5 ml of a 10 mg/ml NaMo0 4 stock 
solution, pH to 6.0. 

15 YEPG medium was composed per liter of 10 g of yeast extract, 20 g of peptone, 

and 20 g of glucose. 

STC was composed of 0.8 M sorbitol, 25 mM Tris pH 8, 25 mM CaCl 2 . 
SPTC was composed of 40% PEG 4000, 0.8 M sorbitol, 25 mM Tris pH 8, 25 
mM CaCl 2 . 

20 COVE trace metals solution was composed per liter of 0.04 g of NaB 4 O 7 10H 2 O, 0.4 

g of CuS0 4 5H 2 0, 1.2 g of FeS0 4 7H 2 0, 0.7 g of MnS0 4 H 2 0, 0.8 g of NajMoO^O, and 
10 g of ZnS0 4 7H 2 0. 

SOX COVE salts solution was composed per liter of 26 g of KC1, 26 g of 
MgS0 4 7H20, 76 g of KH 2 P0 4 , and 50 ml of COVE trace metals. 
25 COVE top agarose was composed per liter of 20 ml of SOX COVE salts, 0.8 M 

sucrose, 1.5 mM cesium chloride, 1.0 mM acetamide, and 10 g of low melt agarose, pH 
adjusted to 6.0. 

COVE medium was composed per liter of 342.3 g of sucrose, 20 ml of SOX COVE 
salt solution, 10 ml of 1 M acetamide, 10 ml of 1.5 M CsCl 2 , and 25 g of Noble agar. 
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SOX Vogels medium was composed per liter of 150 g of sodium citrate, 250 g of 
KH 2 P0 4 , 10 g of MgS0 4 7H 2 0, 10 g of CaCl 2 2H 2 0, 2.5 ml of biotin stock solution, and 5.0 
ml of AMG trace metals solution. 

VogelVacetamide agar was composed per liter of 30 g of sucrose, IX VogePs salts, 
5 10 mM acetamide, 15 mM CsCl, and 25 g of Noble agar. 

Fluoroacetamide medium was composed per liter of 12 g of sodium acetate, 2 g of 
sodium chloride, 0.5 g of MgS0 4 , 3 g of KH 2 P0 4 , 0.3 g of urea, 2 g of fluoroacetamide, 1 
ml of SOX Vogels salts, and 15 g of Agarose I (Amresco; Solon,Ohio), pH 6.1. 

10 Deletion of the pyrG gene in Fusarium venenatum MLY3 

An unmarked pyrG deletion in the genome of the Fusarium venenatum expression host 
MLY3 was generated by initially deleting the pyrG gene and replacing it with the Aspergillus 
nidulans amdS gene flanked by repeated DNA sequences isolated from the upstream region of 
the Aspergillus oryzae pyrG gene. Isolates cured of the amdS gene were then selected by 

15 growth on plates containing fluoroacetamide. 

Plasmid pDM156.2 was constructed by inserting a 3,9 kb genomic EcoKL pyrG 
fragment cloned from Fusarium venenatum strain ATCC 20334 into the EcdRl site of 
pUC118. The pyrG fragment contained the entire coding region plus 1.3 kb of the promoter 
and 1 .5 kb of the terminator (Figure 3). The native pyrG gene of Fusarium venenatum MLY3 

20 was replaced via homologous recombination by a 4.4 kb fragment from which the entire open 
reading frame of pyrG was deleted and replaced with the amdS gene flanked by 230 bp 
repeated sequences as shown in Figure 3 . 

pJRoy47 was constructed first by constructing pJRoy43, which is composed of 
pNEB193 (New England Biolabs) where the Pacl and Xbal sites were replaced with a Swal 

25 site. To accomplish this, a linker containing a BamHl site on the 5' end, a Swal site in the 
middle, and a SaR site on the 3' end was created by annealing the following two oligos 
together: 

5 • -G ATCG ATIT A A AT-3 1 
5 * -TCG A ATTTA A ATC-3 ' 
30 The resulting linker was ligated into pNEB193, which had been digested with BamHl and 
So/1, to create pJRoy43. 
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Next, a 230 bp region upstream of the Aspergillus oryzae pryG gene was chosen as a 
repeat sequence in Fusarium venenaium. This region was amplified as two different products 
from the Aspergillus oryzae pyrG plasmid pJaL335 (WO 98/12300) using the following two 
primer pairs, primers 3 and 4 and primers 5 and 6: 

5 Primer 3: S-CGAATTTCATATTTAAATGCCGACCAGCAGACGGCCCTCG-S' 
Primer 4: 5 '-GCGAT ATC ATGATCTCTCTGGT ACTCTTCG-3 ' 
Primer 5: 5 '-GCGAT ATC ATCGACCAGCAGACGGCCCTCG-3 ' 
Primer 6: 5-GCGTTTAAAC ATGATCTCTCTGGT ACTCTTCG-3 ' 

The amplification reactions (50 [il) were prepared using approximately 40-50 ng of 

10 genomic DNA, prepared using the DNeasy Plant Mini Kit, as the template. Each reaction 
contained the following components: 40-50 ng of genomic DNA, 50 pmol each of the primers 
3 and 4 or primers 5 and 6, 200 each of dATP, dCTP, dGTP, and dTTP, IX Taq DNA 
polymerase buffer, and 2.5 Units of Taq DNA polymerase. The reactions were incubated in a 
Pertdn-Elmer Model 480 Thermal Cycler programmed as follows: Cycle 1 at 94°C for 2.5 

15 minutes and 72°C for 2.5 minutes; cycles 2-26 each at 94°C for 45 seconds, 50°C for 45 
seconds, and 72°C for 2 minutes; and cycle 27 at 94°C for 45 seconds, 50°C for 45 seconds, 
and 72°C for 10 minutes. 

The reaction products were isolated on a 1% agarose gel where approximately 230 bp 
fragments were isolated. The first PCR product (approximately 230 bp fragment) contained a 

20 Swal site at the 5 f end and an EcoKV site at the 3' end while the second PCR product 
(approximately 230 bp fragment) contained an EcoKV site at the 5 % end and a Pmel site at the 
3* end. The two purified repeat fragments were first digested with EcoKV and ligated 
together. After purification (phenol-chloroform etraction and ethanol precipitation), the 
ligation product was then digested with Pmel and Swal to produce approximately a 500 bp 

25 fragment which was purified by agarose gel electrophoresis and agarase treatment. 

The resulting fragment was cloned into Pmel and Swal digested pJRoy43 to create 
pJRoy44 (Figure 4). This vector contains the two 230 bp repeats separated by an EcoKV 
site and flanked by Swal and Pmel sites. 

An EcoSI fragment containing the amdS gene and regulatory region was isolated 

30 from a subclone of p3SR2 (Kelly and Hynes, 1985, EMBO Journal 4: 475-479), made 
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blunt with the Klenow fragment, and ligated into EcoKV digested pJRoy44 to create 
pJRoy47 (figure 3). 

The amdS gene flanked by the 230 bp repeated sequences was obtained from pJRoy47 
as a SwaVPmel fragment and inserted into EcdRV/StuI digested pDM156.2 to create 
pDM222.A (Figure 3). pDM222. A was digested with EcoBJ and the 4.4 kb EcdRI fragment 
containing the pyrG deletion cassette was gel purified using Qiaquick Gel Extraction Kit 
(Qiagen, Chatsworth, CA) prior to transformation. 

Spores of Fusarium venenatum MLY3 were generated by inoculating a flask 
containing 500 ml of RA sporulation medium with three 1cm 2 mycelia plugs from a 
minimal medium plate and incubating at 28°C, 150 rpm for 2 to 3 days. Spores were 
harvested through Miracloth (Calbiochem, San Diego, CA) and centrifuged 20 minutes at 
7000 rpm in a Sorvall RC-5B centrifuge (E. I. DuPont De Nemours and Co., Wilmington, 
DE). Pelleted spores were washed twice with sterile distilled water, resuspended in a small 
volume of water, and then counted using a hemocytometer. 

Protoplasts were prepared by inoculating 100 ml of YEPG medium with 4 X 10 7 
spores of Fusarium venenatum MLY3 and incubating for 16 hours at 24°C and 150 rpm. 
The culture was centrifuged for 7 minutes at 3500 rpm in a Sorvall RT 6000D (E. I. 
DuPont De Nemours and Co., Wilmington, DE). Pellets were washed twice with 30 ml of 
1 M MgS0 4 and resuspended in 15 ml of 5 mg/ml of NOVOZYME 234™ (batch PPM 
4356, Novo Nordisk A/S, Bagsvard, Denmark) in 1 M MgS0 4 . Cultures were incubated 
at 24 °C and 150 rpm until protoplasts formed. A volume of 35 ml of 2 M sorbitol was 
added to the protoplast digest and the mixture was centrifuged at 2500 rpm for 10 minutes. 
The pellet was resuspended, washed twice with STC, and centrifuged at 2000 rpm for 10 
minutes to pellet the protoplasts. Protoplasts were counted with a hemocytometer and 
resuspended in an 8:2:0.1 solution of STC:SPTC:DMSO to a final concentration of 1.25 X 
10 7 protoplasts/ml. The protoplasts were stored at -80°C, after controlled-rate freezing in a 
Nalgene Cryo 1°C Freezing Container (VWR Scientific, Inc., San Francisco, CA). 

Frozen protoplasts of Fusarium venenatum MLY3 were thawed on ice. One jig of 
the 4.4 kb EcoRI fragment from pDM222. A described above and 5 ill of heparin (5 mg per 
ml of STC) were added to a 50 ml sterile polypropylene tube. One hundred \xl of 
protoplasts was added, mixed gently, and incubated on ice for 30 minutes. One ml of 
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SPTC was added and incubated 20 minutes at room temperature. After the addition of 25 
ml of 40°C COVE top agarose plus 10 mM uridine, the mixture was poured onto a 150 mm 
diameter plate containing COVE agar to select for integration of the amdS gene. 

The native 3.9 kb EcoRI fragment of the pyrG gene of Fusarium venenatum MLY3 
was replaced via homologous recombination by the 4.4 kb fragment containing pyrG flanking 
sequence and the amdS gene flanked by 230 bp repeated sequences. This resulted in the 
deletion of the entire pyrG coding region plus 0.78 kb of 5' untranslated and 0.8 kb of the 3* 
untranslated regions. Replacement of the native pyrG gene with the deletion cassette was 
confirmed by Southern hybridization. Southern hybridizations were conducted using the 
Amersham (Arlington, IL) Vistra and Rapid Hyb protocols or the Boehringer Mannheim DIG 
System protocol provided by the manufacturer. Fungal genomic DNA of putative deletants 
was prepared using the DNeasy Plant Mini Kit (Qiagen Chatsworth, CA) and digested with 
EcoRI. Southern blots were made using Magnagraph membrane (Micron Separations, Inc; 
Westborough, MA) and standard molecular biology techniques. Membranes were developed 
following the manufacturer's instructions and scanned using a STORM 860 (Molecular 
Dynamics, Sunnyvale, CA) for fluorescein-Iabeled probes or exposed to film for DIG-labeled 
probes. The 3.2 kb pyrG probe contained the entire pyrG coding region, 0.64 kb 5' region, 
and 1.42 kb 3 * untranslated region. 

A deletant strain was sporulated in RA medium plus 10 mM uridine for approximately 
3 days at 28°C, 150 rpm and single spores were isolated using a micromanipulator. Spore 
isolates were grown on VogePs/acetamide agar plus 10 mM uridine. One spore isolate was 
sporulated in RA medium plus 10 mM uridine. The spore culture was filtered through 
Miracloth to remove mycelia. Selection for loss of the amdS marker occurred by spreading 9.5 
x 10 5 spores of the confirmed pyrG deletant onto each of five 150 mm plates containing 
fluoroacetamide medium plus 10 mM uridine. These plates were incubated at room 
temperature for up to 2 weeks. Resulting colonies were subcultured onto COVE plates and 
fluoroacetamide plates plus lOmM uridine. Colonies that grew well on fluoroacetamide and 
poorly or not at all on COVE were analyzed further. Spore isolates from three of these strains 
were subjected to Southern blot analysis. Genomic DNA was cut with EcoRI and probed with 
the pyrG probe described above. Several of the spore isolates yielded a 1.4 kb hybridizing 
band indicating an amdS "loop-out". One spore isolate was chosen and was designated 
Fusarium venenatum DLM 15. 
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Example 6: Expression Levels of Lipase in Independently Grown Fusarium venenatum 
DLM15 Transfonnants 

The pyrG minus Fusarium venenatum strain DLM15 described in Example 5 was trans- 
formed with pENI1298 (described in example 2) using standard procedures, as described in 
Royer et al. 1995, Biotech. 13 1479-1483, replacing acetamide with sodium nitrate (25 
mM). The cells were then cultured on minimal plates at room temp. Transformants ap- 
peared after 2 weeks' incubation at a transformation frequency of 50 - 100 /\xg DNA, an 
increase of 10-100 fold over the transformation frequency in the absence of a AMA1 se- 
quence. 20 transformants were isolated on minimal media plates and grown for additional 2 
weeks at room temp. The transformants were inoculated in a micotiter dish in 200 |il vogel 
media (as in example 2) either supplemented with 20 mM uridine or without uridine, and 
grown at room temp. Transformants supplemented with uridine grew well, but were not 
able to grow when isolated on minimal media plates without uridine, indicating that the 
plasmid containing the pyrG gene had been lost, and thus that the plasmid was autono- 
mously replicating. No lipase activity could be detected from transformants grown in 
uridine supplemented media. Lipase activity could be detected in the media from tranfor- 
mants growing in minimal media not supplemented with uridine. The average arbitrary ac- 
tivity was 34.8 with a relative standard deviation of 15 % when comparing independently 
grown Fusarium transformants. Thus this plasmid can potentially be used for the creation of 
libraries in Fusarium species. 
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Claims 

1. A method of constructing and selecting or screening a library ofpolynucleotide se- 
quences of interest in filamentous fungal cells wherein the method comprises: 

(a) transforming the fungal cells with a population of DNA vectors, wherein each 
vector comprises: 

(i) a gene encoding a fungal selection marker and a fungal replication initiating 
sequence wherein the marker and the replication initiating sequence do not 
vary within the population; and 

(ii) a polynucleotide sequence of interest wherein the population of DNA vectors 
contains more than one variant of the polynucleotide sequence; 

(b) cultivating the cells under selection pressure; 

(c) selecting or screening for one or more transformants expressing a desired charac- 
teristic; and 

(d) isolating the transformant(s) of interest. 

2. The method according to claim 1, wherein the library of polynucleotide sequences of in- 
terest is prepared by random mutagenesis or naturally occurring allelic variations of at 
least one parent polynucleotide sequence having or encoding a biological activity or 
function of interest. 

3. The method of claim 1 or 2, wherein the polynucleotide sequence encodes a polypeptide 
or is a control sequence; or wherein the polynucleotide sequence encodes a polypeptide 
or part thereof and further comprises a control sequence involved in the expression of the 
polypeptide or a part of such control sequence. 

4. The method according to claim 3, wherein the polypeptide is a hormone, an en2yme, a 
receptor or a portion thereof, an antibody or a portion thereof, or a reporter, or a regulatory 
protein. 
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5. The method f claim 4, wherein the enzyme is an oxidoreductase, a transferase, a hy- 
drolase, a lyase, an isomerase, or a ligase. 

6. The method according to any of claims 4 or 5, wherein the enzyme is an aminopepti- 
dase, amylase, carbohydrase, carboxypeptidase, catalase, cellulase, chitinase, cutinase, 
cyclodextrin glycosyltransferase, deoxyribonuclease, esterase, alpha-galactosidase, 
beta-galactosidase, glucoamyiase, alpha-glucosidase, beta-glucosidase, invertase, lac- 
case, lipase, mannosidase, mutanase, oxidase, a pectinolytic enzyme, peroxidase, phy- 
tase, polyphenoloxidase, a proteolytic enzyme, ribonuclease, transglutaminase, or xyla- 
nase. 

7. The method according to claim 1 or 2, wherein the control sequence is an enhancer se- 
quence, a leader sequence, a polyadenylation sequence, a propeptide sequence, a pro- 
moter, a replication initiation sequence, a signal sequence, a transcriptional terminator 
or a translational terminator. 

8. The method of claim 7, wherein the promoter is derived from the gene encoding As- 
pergillus oryzae TAKA amylase, NA2-tpi and Aspergillus niger or Aspergillus awamori 
glucoamyiase. 

9. The method according to any of claims 1-8, wherein the selective marker is selected 
from the group of genes which encode a product which provides for resistance to bio- 
cide or viral toxicity, resistance to heavy metal toxicity, or prototrophy to auxotrophs. 

10. The method of claim 9, wherein the prototrophy is obtained from an enzyme selected 
from the group of metabolic pathways consisting of nucleotide synthesis, cofactor syn- 
thesis, amino acid synthesis, acetamide metabolism, proline metabolism, sulfate me- 
tabolism, and nitrate metabolism. 

11. The method of claim 9, wherein the selective marker is a gene selected from the group 
consisting of argB (ornithine carbamoyltransferase), amdS (acetamidase), bar 
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(phosphinothricin acetyltransferase), hemA (5-aminolevulinate synthase), hemB 
(porphobilinogen synthase), hygB (hygromycin phosphotransferase), niaD (nitrate re- 
ductase), prn (proline permease), pyrG (orotidine-S'-phosphate decarboxylase), pyroA, 
riboB, sC (sulfate adenyltransferase), and trpC (anthranilate synthase). 

12. The method of any of claims 1 to 11, wherein the replication initiating sequence is a 
nucleic acid sequence selected from the group consisting of: 

(a) a replication initiating sequence having at least 50% identity with the nucleic acid se- 
quence of SEQ ID NO: 1 or SEQ ID NO:2 and is capable of initiating replication; 

(b) a replication initiating sequence which hybridises under low stringency conditions with 
(i) the nucleic acid sequence of SEQ ID NO: 1 or SEQ ID NO:2, or (ii) the respective 
complementary strands, wherein the low stringency conditions are defined by prehy- 
bridization and hybridization at 42°C in 5x SSPE, 0.3% SDS, 200 jig/ml sheared and 
denatured salmon sperm DNA, and 25% formamide, and wash conditions are defined 
at 50°C for 30 minutes in 2x SSC, 0.2% SDS; and 

(c) a subsequence of (a) or (b), wherein the subsequence has replication initiating activity. 

13. The method of claim 12, wherein the nucleic acid sequence has at least 50% identity, more 
preferably about 60%, even more preferably about 70%, even more preferably about 
80%, even more preferably about 90%, and most preferably about 97% identity with 
the nucleic acid sequence of SEQ ID NO: 1 or SEQ ID NO:2. 

14. The method of claim 12, wherein the replication initiating sequence is obtained from a 
filamentous fungal cell. 

15. The method of claim 14, wherein the filamentous fungal cell is a strain of Aspergillus. 

16. The method of claim 15, wherein the strain of Aspergillus is obtained from a strain of 
A. nidulans. 
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17. The method of any of claims 12 to 16, wherein the replication initiating sequence has 
the nucleic acid sequence set forth in SEQ ID NO:l or SEQ ID NO: 2, or is a respective 
functional subsequence thereof. 

18. The method of claim 2, wherein the modification of the parent polynucleotide sequence 
is performed by mutagenesis, preferably random mutagenesis, by use of a physical or 
chemical mutagenizing agent, use of a doped oligonucleotide, DNA shuffling, or by 
subjecting the nucleic acid sequence to PCR generated mutagenesis, or use of any com- 
bination thereof. 

19. The method of claim 18, wherein the polynucleotide sequences of interest are obtained 
by in vivo recombination between two or more homologous nucleic acid sequences en- 
coding a polypeptide or a regulatory sequence, or any combination of both, compris- 
ing: 

(a) identifying at least one conserved region between the polynucleotide sequences of 
interest; 

(b) generating fragments of each of the polynucleotide sequences of interest, wherein 
said fragments comprise the conserved region(s) of (a); and 

(c) recombining the fragments of (b) by using the conserved region(s) as (a) homolo- 
gous linking point(s). 

20. The method according to any of claims 1-19, wherein the filamentous fungal cell trans- 
formed with the population of DNA vectors is a cell of a strain of Acremonium, Asper- 
gillus^ Coprinus, Fusarium, Humicola, Mucor, Myceliopthora, Neurospora, Penicil- 
lium, Thielavia, Tofypocladium or Trichoderma. 

21. The method according to claim 20, wherein the cell is an Aspergillus oryzae, Aspergil- 
lus niger, Aspergillus nidulans, Coprinus cinereus, Fusarium oxysporum, or Trichoderma 
reesei cell. 
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22. A method of constructing and screening or selecting a library of polynucleotide se- 
quences of interest in a filamentous fungal cell, wherein the method comprises: 

(a) transforming a culture of bacterial or yeast cells with a population of the DNA 
vectors as described in any of claims 1-21, wherein the vector further comprises a 
nucleic acid sequence encoding a bacterial or yeast selective marker and a bacterial 
or yeast replication initiating sequence; 

(b) cultivating the bacterial or yeast cells under selection pressure; 

(c) isolating the DNA constructs from the transformants of (b); 

(d) transforming filamentous fungal cells with the DNA constructs of (c); 

(e) cultivating the filamentous fungal cells of (d); 

(f) selecting or screening for one or more filamentous fungal transformants expressing 
a desired characteristic; and 

(g) isolating the filamentous fungal transformant(s) of interest. 

23. The method of claim 22, wherein the bacterial or yeast selective marker is selected 
from the group of genes which encode a product which provides for resistance to bio- 
cide or viral toxicity, resistance to heavy metal toxicity, or prototrophy to auxotrophs. 

23, Use of a fungal replication initiating sequence in the construction of a library of 
polynucleotide sequences of interest. 

24. The use according to claim 23, wherein the fungal replication initiating sequence is a 
nucleic acid sequence selected from the group consisting of: 

(a) a replication initiating sequence having at least 50% identity with the nucleic acid se- 
quence of SEQ ID NO: 1 or SEQ ID NO:2 and is capable of initiating replication; 

(b) a replication initiating sequence which hybridises under low stringency conditions with 
(i) the nucleic acid sequence of SEQ ID NO:l or SEQ ID NO:2, or (ii) the respective 
complementary strands, wherein the low stringency conditions are defined by prehy- 
bridization and hybridization at 42°C in 5x SSPE, 0.3% SDS, 200 ng/ml sheared and 
denatured salmon sperm DNA, and 25% formamide, and wash conditions are defined 
at 50°C for 30 minutes in 2x SSC, 0.2% SDS; and 
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(c) a subsequence of (a) or (b), wherein the subsequence has replication initiating activity. 

25. The use according to claim 24, wherein the replication initiating sequence is obtained 
from a filamentous fungal cell, in particular from a strain of Aspergillus, such as A. 

5 nidulans. 

26. The use according to claim 24 or 25, wherein the replication initiating sequence has the 
nucleic acid sequence set forth in SEQ ID NO:l or SEQ ID NO:2, or is a respective 
functional subsequence thereof. 

10 

27. A library of polynucleotide sequences of interest which library comprises filamentous 
fungal cells tranformed with a population of DNA vectors, wherein each vector com- 
prises: 

(i) a gene encoding a fungal selection marker and a fungal replication initiating sequence 
15 wherein the marker and the replication initiating sequence do not vary within 

the population; and 

(ii) a polynucleotide sequence of interest wherein the population of DNA vectors contains 
more than one variant of the polynucleotide sequence. 

20 28. The library according to claim 27, wherein the vector further comprises a nucleic acid 
sequence encoding a bacterial or yeast selective marker and a bacterial or yeast replication 
initiating sequence. 
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SEQ ID NO 1 



aagcttattttttgtatactgttttgtgatagcacgaa^ 

cttacctacatcaaattaataagagactaattataaactaaacacacaagcaagctactttag 
gcttttgacgtataaacgttgcttgtattta^ 

atctcaggtttgcgttaaactaaatcagagcccgagaggttaacagaacctagaaggggactagatatccgggtagggaa 

acaaaaaaaaaaaacaagacagccacatattagggagactagttagaagctagttccaggactaggaa^ 

tgataccaragtctagttgacaactagatagattctagattg^ 

tactagttagtatctagtctcctataa^ 

aaacggagtgattgcaaggtgttcagagactagttattgactagtcagtgactagcaataactaacaaggtattaaccta 
ccatgtctgccatcacxctgcacttcctcgggctra^ 

gactgtgactagtcaaagactagtccagaaccacaaaggagaaatgtcttaccactttcttcattgcttgtctcttttgc 
attatc^tgtctgcaactagttagagtctagttagtgactagtccgacgaggacttgcttgtct^ 
aactctccagggcctcaagatccacaacagagccttctagaagactggtcaataactag 
cttacgaggttgcatactcgctccctttgcct^ 

cacacggtgctgagactagttagaatctagtccc^uiactagcttggatagcttaccttt^^ 

cttgcagggtatggttcctttctraccagctgatttagctgccttgctaccctcacggc^^ 

tagaggttataaattagcactgatcctaggtacggggctgaatgtaacttgcctttcctttctc^tcg 

ggcttgctc^aattcctaccagtcacaggggtatgcacggcgtacggaccacttgaac^ 

tagtctgrattgaatggctgtacttacgggccctc^^ 

tagttagtgactagtc^ggactagttgaaatgggagaagaaactracg^^^ 

ggacttggtgctgctatatattagctaatacactagttagac^ 

ggagtcggggttgggagaacagtgccttcaaacaagccttcataccatgctacttgactagtcagg 

taatctagataggacttgcctttggccterat^ 

gtgggagttcttgtc^ttcaagtgcttgaccaatatgtttrt^ 

cagaaactatgatagcagtagactcactgtacgcttgaggcatcccttcactcggcagtagacttcatat^ 

caggcacgccattgtcgtcctgtggactagtcagtaactaggcttaaagctagtcgggtcggctta^ 

ggcagcgtaagctccccgtccttaactgccte^ 

gcgaatgctcggcatactaactgttgactagtcttggacto^^ 

agagtgagacaaagagcgaaataagagcttcaaaggctatctc^ 

gatttaaagagamcgactagttatgtacctggagtttggatataggaatgtgttgtggtaacgaaatgtaaggggga^ 
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gaaagaaaaagtcgtc^gaggtaactctaagtcggccattcctttttgggaggcgctaaccataaacggcatggtcgac 

ttagagttagctcagggaatttagggagttatctgcgaccaccgaggaacggcggaatgccaaagaa 

ctagctggcggttgac^ccccaccttttggcgtttctgcggcgttgcaggcgggactggatacttcgtagaaccagaaa 

ggcaaggcagaacgcgctcagcaagagtgttggaagtgatagcatgatgtgccttgttaactaggtaccaatctgcagta 

tgcttgatgttatccaaagtgtgagagaggaaggtccaaacatacacgattgggagagggcctaggtataagagtttttg 

agtagaacgcatgtgagcccagccatctcgaggagattaaacacgggccggcatttgatggctatgttagtaccccaatg 

gaaacggtgagagtccagtggtcgcagataactccx^taaattccctgagctaactctaagtcgacc^ 

tagcgcctcccaaaaaggaatggccgacttagagttacctcttgacgactttttctttcctcccccttacatttcgttac 

cacaararattcctatatccaaactccaggtaca^ 

accttcatactgcttagagatagcctttgaagctcttatttcgctctttgtctcactctcaccttcctcctcctcctcca 
atcctttttgctcgggactagtcc^gactagtcaacagttagtatgccgagcattcgcgataacgata^ 
aagtccccagagtactgtcactatctcgaggcagttaaggacggggagcttacgctgccggatttcaagatagtaagccg 
acccgactagctttaagcctagttactgactagtrcacaggac^ 

actgccgagtgaagggatgcctcaagcgtacagtgagtctactgctatcatagtttctgactagttattaactagttgac 

aggttccctctgccaacagaaacatattggtcaagcacttgaaggacaagaactcccacggcatgga 

aatggtcctcccactatgaaggaactgatggaggccaaagg 

agtcaagtagc^tggtatgaaggcttgtttgaaggcactgttctcccaaccccgactcctaccaagaagcgcaagcgagc 

tgcgtaagtttctgtgagtctaactagtglattagctaatatatagcagcaccaagtccaaggaccacaatactaaggga 

gtcgagaattcgtgagmcttctcccatttcaactagtcct^ 

gctggaaatgatcaggacaatggcgagggcccgtaagtacagccattcaatgcagactagttgctaactaatctgtgact 
agttcaagtggtccgtacgccgtgcatacccctgtgactggtaggaatttgagcaagcctgtcttgccgcgcgatgagaa 
aggaaaggcaagttacattcagccccgtacctaggatcagtgcto 

gccgtgagggtagcaaggcagctaaatcagctggtgagaaaggaaccataccctgcaagacctgtcgcaacgcaaagggc 

aaaggtaagctatccaagctagtttgggactagattctaactagtctcagcaccgtgtggttcaaagccatattgcgag 

tttggcgctttttctcatcgattgacgaggcaaagggagcgagtatgcaacctcgtaagtcagactcagacaaagaccaa 

ctagttattgaccagtcttctagaaggctctgttgtggatcttgaggccctggagagttcctccaacaatccggagacaa 

gcaagtcctcgtcggactagtcactaactagactctaactag 

gaaagtggtaagacatttctcctttgtggttctggactagtctttgactagtcacagtcttaaacaag^ 
gaaaatgaggaggaaaaggctgctgagcccgaggaagtgcagggtgatggcagaca^ 
attgctagtcactgactagtcaataactagtctctgaacaccttgcaatcactccgtU^ 
gataatagtagtaagttattctagcttcagagttataggagactagatactaactagtattagttgcaactaacctggat 
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ctcagagactttggcctcaatctagaatctatctagttgtcaactagactgtggtatcattgtcttttattttcctagtc 
ctggaactagcttctaactagtctccctaatatgtggctgtcttgttttttttttttgtttccctacccggatatctagt 
ccccttctaggttctgttaacctctcgggctctgatttagtttaacgcaaacctgagattagtttctaactagtctctag 
gttttctatcc^cctttaattgtaataataaatacaagcaacgtttatacgtcaaaagcatttataaacttttaccct^ 
5 agtagcttgcttgtgtgtttagtttataattagtctcttatmtttgatgtaggtaagcccgccacaaatatatatttt 
tacaagataccgtggaaaaacttcgtgctatcacaaaacagtatacaaaaaataagctt 

SEQ ID NO 2 

10 aagcttattttttgtatactgttttgtgatagracgaagtttttc^ 

cttacctacatcaaattaataagagactaattataaactaaacacacaagcaagctactttagggtaaaagtt^ 
gcttttgacgtataaacgttgcttgtatttattattacaattaaaggtggatagaaaacctagagactagttagaaacta 
atctcaggtttgcgttaaactaaatcagagcccgagaggttaacagaacctagaaggggactagatatccgggtagggaa 
acaaaaaaaaaaaacaagacagccacatattagggagactagttagaagctagttccaggactaggaaaataaaaga^ 

15 tgataccacagtctagttgac^ctagatagattctagattgaggccaaagtctctgagatccaggtt^ 
tactagttagtatctagtctcctataactctgaagctagaataac^ 

aaacggagtgattgc^ggtgUcagagactagttaUgactagtcagtgactagcaataactaaca^ 
ccatgtctgccatcaccctgcacttcctcgggctcag 

gactgtgactagtcaaagactagtccagaaccacaaaggagaaatgtcttaccactttcttcattgcttgtct 
20 attatccatgtctgcaactagttagagtctagttagtgactagtccgacgaggacttgcttgtctccg 
aactctcx^gggcctcaagatccacaacagagccttctagaagactggtcaataactagttg^ 
cttacgaggttgcatactcgctccctttgcctc^^ 

cacacggtgctgagactagttagaatctagtcccaaactagcttggatagcttacctttgcc 
cttgcagggtatggttcctttctcacx^gctgatttag^ 
25 tagaggttataaattagc^ctgatcctaggtacggggctgaatgtaacttgcxtttcctttctcatcgcgcggcaagaca 
ggcttgctcaaattcctaccagtcacaggggtatgcacggcg 

A 

tagtctgc^ttgaatggctgtacttacgggccctcgccattgtcctgatcatttccagcttcaccctcgttgctgcaaag 
tagttagtgactagtcaaggactagttgaaatgggagaagaaactcacgaattctcgactcccttagtattgtggtcctt 
ggacttggtgctgctatatattagctaatacactagttagactcac^gaaacttacgcagctcgcttg^ 
30 ggagtcggggttgggagaacagtgccttcaaacaagc^ttcatac^ 

taatctagataggacttgcctttggcx:tccatcagttccttcatagtgggaggaccattgtgc^ 
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gtgggagttcttgtccttcaagtgcttgaccaat^ 

cagaaacUtgatagcagtagactcactgtacgcttgaggcatcccttcactcggcagtagacttcatatggatggatat 
caggcacgccattgtcgtcctgtggactagtcagtaactaggcttaaagctagtcgggtcg 
ggcagcgtaagctccccgtccttaactgcctcgagatagtgacagtactctggggactttcggagatcgttatcgttatc 
5 gcgaatgctcggcatactaactgttgactagtcttggacto^ 

agagtgagacaaagagcgaaataagagcttcaaaggctatctctaagcagtatgaaggttaagtatctagttcttgacta 

gatttaaagagatttcgactagttatgtacctggagtttggatataggaatgtgttgtggtaacgaaat^ 

gaaagaaaaagtcgtcaagaggtaactctaagtcggccattcctttttgggaggcgctaaccataaacggcatggtcgac 

10 
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